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SEQUENCE LISTING ^MRffKl 2 2 OEC 2KB 

SEQ ID NO: 1 (Haemophilus aegyptius) 

MKRNLLKQSVIAVLIGGTTVSNYAIAQAQRQAQVKKDELSELKKQVK^ 

KTIAEKAKGDSSEALDKIE^LEEQNDEFLADITALEEGVDGLDDDIAGIQDNISDIEDDINQNSADIATNTMIATHTQRLDNLDNRV 
NNLNKDLKRGLAAQAALNGLFQPYNVGKLNLTAAVGGYKSQTAVAVG 

SEQ W NO: 2 (Escherichia coli) 

MKTVNVALLALIISATSSPWIAGDTIEAM 
DPDAGQKTVMTGDVQTTARYQSMINMQSAVTDAQQTQITEQ 

GQKMNALTTDVAVQQQNERTQYDKQMQSLAQESAQAHEQIDSLSQDVTQTHQQLTNTQKRVADNSQQINTLNNHFSSLKNEVDDNRKE 
ANAGTASAIAIASQPQVKTGDVMMVSAGAGTFNGESAVSVGTSFNAGTHTVLKAGISADTQSDFGAGVGVGYSF 

SEQ ID NO: 3 (EHEQ 

MNKI FKVI WNPATGNYTVTSETAKSRGKKSGRSKLLI SALVAGGMLSSFGALANAGNDNGQGVDYGSGSAGDGWVAIGKGAKANT FMNTSGS ST AVG 

YDAIAEGQYSSAIGSKTHAIGGASMAFGVSAISEGDRSIALC^SSYSLGQ 

GDTANASKAYSMAL(^SSVASEENAIAI(MTEAAENAT^^ 

GNGSQSRGLNTIALGTASNATGDKSIJUjGSNSSMGINSVALGADSIADLDNTVSVGNSSLKRKIVNVKNGAIKS 

GGGAAVDVDDGTmPTYNLKNGSKNNVGAALAVLDENTLQWDQTKGKYSAAHGTSSPTASVITD 

IATNTSNIATNTANIATNTTNITNLTDSVGDLQADALLWNETKKAFSAAHGQDTTSKITNVKDADLTADSTDA 

SNIATN™ISNLTETVTNIjGEDALKWDKDN^ 

LGEDALKWDKDNGVFTAAHGNNTASKITNILDGTV^ 

STSLGDALLWDATAGKFSAKHGTNGDASVITDVADGEISDSS^^^ 

TTLDDALLWDADAGENGAFSAAHGKDKTASVITNVANGAI S AASSDAINGSQLYTTNKY I ADALGGDAEVNADGT I TAPTYTIANAEYNN VGDALDA 
LDDNALLWDETAHGGAGAYNASHDGKAS I ITNVANGSI SEDSTDAVNGSQLNATNMMI EQNTQI INQLAGNTDATYIQENGAGI NYVRTNDDGLAFN 
DASAQGVGATArGYNSVAKGDSSVMGQGSYSDVDTGI^ 

EAHDAVTVRQLQNAIGAVATTPTK YFHANSTEEDSLAVGTDSLAMGAKTI VNGDKGIGI GYGAYVDANALNGIAI GSNAQVI HVNS I AI GNGSTTTR 
GAQTNYTAYNMDAPQNSVGEFSVGSADGQRQITNVMGSADTO 

KTNTDGVDASAQGKDSVAIGSGS I AAADNSVALGTGSVATEENTI SVGS STNQRRITNVAAGKNATDAVNVAQLKS SEAGGVRYDTKADGS I DYSNI 

TLGGGNGGTTRISNVSAOTNNDVWYAQLKQSVQETKQYT^ 

IiGVSMVSANGRWVYKLQGSTNSQGEYSAALGAGIQW 

SEQ ID NO: 4 (Actinobacillus actinomycetemcomitans) 

MTYQLFKHHLVALMVTGAISVNALMDSFLENPSML^ 
QELVNRARITKNELDIRKNTKSIAEbTTA^^ 
AMSRAIATFRSSSQNIAALTTKVDRNTARIDRLDSRVNEL^ 
RFNQNVAAKAGVAVSTNGGSATYNVGLNFEW 

SEQ ID NO: 5 (Haemophilus somnus) 

MKKVQFFKYSSLALALGLGVSASALMPTSTSTm 

GWNEVKSLPRIDGNGKDKQTKDQIMLIRTVDOTK^ 

DFVEDVNESEARNISRIDENEKDINTLKELMDEDLNSV^ 

DMKQETEDDIADNAKAIHSNTKGIMNTKDIRDLDTKTKQML^ 

EQAIRQNTAGLVWNKRVDTLDKNTKAGIASAVALGML^ 

ATFGGSVGFFFN 

SEQ ID NO: 6 (Haemophilus ducreyi) 

MKIKCLVAWGLACSTITTMAQQPPKFAGVSSLYSYEYDYGKGKWTWSNEGGFDIKVPGIKMKPKEWISKQATY 

TSAPDVSPSSISILLYPMSDPDQLGINRQQLKLNLYSYFNDLRHDFKLKVLDARISKNKQNIDTISKYLLELGTYLDGSYRMMEQNTH 

NINKNTHNINKNTHNINKLSKELQTGLMQ 

YGASVGYEF 

SEQ ID NO: 7 (EPEC) 

MTVNVALLALIISATSSPFVLAGDTIEAMTELSAINSGMSQSEIEQKITRFLERTDNSPMYTYLTEHHYIPSETPDTTQTPPVQ^ 
DPDAGQKTVAATGDVQTTARYQSMINARQSTVTDAQQTQITEQQAQIVATQKTIJ^TGDTQNTAHYQEMINARLMQNEANQRTTTEQ 
GQKMNALTTDVAAQQQKERAQYDKQMQSLAQKSVQAHEQIESLRQDSAQTQQQLTNTQKRVADNSQQINTLNNH 
ANAGTASAIAIASQPQVKTGDLMMVSAGAGTFNGESAVSVGTSFNAGTHTVLKAGISADTQSDFGAGVGVGYSF 
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SEQIDNO:8(EAEC) 

MKTVKLSL3AVWATAVSPSAFAGDTVEMTTELTVIQPGMSQSEIDQKIGRFLERTGNSVMQNYLIAHDYQTTTPQEN 
TNTLNPITNQAQTDRDNGQDTAIQDAQHAANWASLKADDAQHAITVAQTDIDANTMITDTMDVSAVQSDVTNIKG 
MANMTALINGVKLSGAVTENKNNIEQNRSDIADQQKLLASNEQKQIVRDNGQDTAIQDAQHAANWASLKADDAQHAITVAQTDIDA 
NKAAITDIRNDVSAVQSDVTNIKGDVAHAQSTADHANANANTALMNGW 

QDTAIQDAQHAANWASMKADDAQHAITVAQTDIDANKMIADTRNDVSAVQSDVTNIKGDVAHAQSTADHMANANTAL 
VTENKNNIEQNRSDIADQQQQLDETRKIVMTGDVQTAARYQSMIDARQTAAANAQQAQADTQQQQM 

TNAHYQEMVNAGLRAQNDMARTAAEQKQKIDTLATNQATQQHINSVQYGEQIQRLAQDSTQTHEQIDSLTQDVTQTHQQLSNTQm 

ADNSQQimNNHFSSLKNEVEDNRKEMAGTASAIAIASQ 

SDFGAGVGVGYSF 

SEQWNO:9(EAEC) 

MNKI FKVIWNPATGS YT VASETAKSRGKKSGRS KLLI SALVAGGMLS S FGVQAQAGRDNGQGVNYGQGTGTGWVAI GEDAKANS FT DT 
(&GSSTAVGYHSTADGRWSTALGAKTHSLGEAS 

AIAFGRESKALGIMSIALGATMASKEYAMALGASSAASAAMIAVGRNSAAAGVDSMFGRQSA^ 
TNAEMGLNSIALGSGSIADVDNTIALGNQSQAVMGAIAIGQGNKATO 

AQGKNSVALGAGSVATDADTVSVGNTTAQRQIVNMAAGDISTTSTDAINGSQLYAISKSVADNLGGGATVNAQGWTSPNYRLKSGIF 

GTVGDALTGLDNNTLQWDSLKKAYSAMGTDTTSTITNVKDGAISDTSKDA^ 

KDDALLWNGTAFSAAHGTEATSKITNVKDGDLT^ 

ATSKITNVKDGDLTAGSTDAVNGSQLKTTNDAVAm 

GDLTAGSTDAVNGSQLKTTNDAVAMTTNIATNTTNITNLTDAVDSLGDDSLLWNATAGAFSAAHGTDATSKITNVKDGDLTAGSTDA 

VNGSQLKTTNDAVAANTTNIATNTTNITNLTDAVDSLGDDSLLWNATAGAFSAKHGTNGTDSKITNLLAGTVSSDSTDAINGSQLYGL 

ADSFTSYLGGGADISDAGVLTGPTYTIGGTDYNNVGDALAAINTSFSTSLGDALLWDATAKGGDGAFSAGRGTDNTASIITNVADGAI 

SSTSSDMNGSQLYDTSKYIADTLGGDAEVNADGTITAPTYAIAGGSYSNVGDALEAIDTTLDDALLWDATANDGNGAFSAAHGKDKT 

ASVITOTANGAISATSSDAINGSQLYTTNKYIADALGGDAEVNADGSITAPTYTIANAEYNNVGDALDALDDNALLWDATANDGAGAY 

NASHDGKASIITNVADGNIGEGSTDAINGSQLFNTNMLIQQNSEIINQLAGNTSETYIEDNGAGINYVRTNDNGLAFNDASASGIGA 

AVGYNAVASGESSVAIGQGSSSWDTGIALGSSSVSSRVIVKGSRDTSVSEEGWIGYDTTDGELLGALSIGDDGKYRQIINVADG 

AHDAVTVRQLQNAIGAVATTPTKYFHANSTEEDSLATC^ 

GNGSQTTRGAQTGYMYNMDAPQNSVGEFSVGSEDGQRQITNVMGSADTDAVNVGQLKVTDAQVSQNTQSITNLNNQVTNLDTRVTN 
IENGIGDIVTTGSTKYFKTNTDGVDMAQGKDSVAIGSGSIAMDNSVALGTGSVANEENTISVGSSTNQRRITNVAAGVNATDAVNV ■ 
SQLKSSEAGGVRYDTKADGSVDYSNITLGGGNGGTTRISNVSAGVNNNDAVNYAQLKQSVQETKQYTDQRMVEMDNK^ 
IASAMMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSANGRWVYKLQGSTNSQGEYSAALGAGIQff 

SEQ ID NO: 10 (UPEC) 

MNKI FKVIWNPATGS YTVASETAKSRGKKSGRSKLLIS 
GASTALGYDAIAEGEYSSAIGSKTLATGGASMAFGVSAKAMGDRSVM^ 

ALGNTAKAYEIMSIMGDNMASKEYAMALGASSKAGGADSLAFGRKSTANSTGSLAIGADSSSSNDNAIAIGNKTQALGVNSMALGN 

ASQASGESSIALGNTSEASEQNAIALGQGSIASKVNSIALGSNSLSSGENAIALGEGSMGGSNSMFGSQSRANGNDSVAIGVGAAA 

ATDNSVAIGAGSTTDASNTVSVGNSATKRKIVNMMGAISNTSTDAINGSQ 

NVGDALSGIDNNTLQraKTAGAFSMHGMATNKITNVAKGTVSATSTDVWGSQLYDLQQDALLW^ 

GNLTAGSTDAVNGSQLKTTNDNVTTNTTNIATNT^^ 

VNGSQLKTTNDNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSMHGTDATSKITNVTAGNLTAGSTDAVNGSQ 
DNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGSTDAVNGSQLKTTNDNVSTNTTNI 
TNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGSTDAVNGSQLKTTNDNVSTNTTNITNLTDSVGDLKDDSLLW 
NKAAGAFSAAHGTEATSKITNLIAGKISSNSTDAINGSQLYGVADSFT 

FSTSLGDALLWDATAGKFSAKHGINNAPSVITDVANGAVSSTSSDAINGSQLYGVSDYIADALGGNAWNTDGSITTPTYAIAGGSYN 
NVGDALEAIDTTLDDALLWDTTANGGNGAFSAAHGKDKTASVITNVANGAVSATSNDAINGSQLYSTNKYIADALGGDAEVNADGTIT 
APTYTIANTDYNNVGEALDALDNNALLWDEDAGAYNASHDGNASKIT^ 

SETYIEENGAGINYVRTNDSGLAFNDASASGIGATAVGYNAVASHASSVAIGQDSISEVDTGIALGSSSVSSRVIVKGTRNTSVSEEG 
WIGYDTTDGELLGALSIGDDGKYRQIINVADGSEAHDAVTVRQLQNAIGAVATTPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGN 
AGIGIGLNTLVLADAINGIAIGSNARANHADSIAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFSVGSEDGQRQITNVAAGSADTDAV 
l^GQLmDAQVSQNTQSITNLNTQVTNLDTRVTNIENGIGDIVTTGSTKYFKTNTDGADANAQGKDSVAIGSGSIAAADNSVALGTG 
SVADEENTISVGSSTNQRRITNVAAGVNATDAVNVSQLKSSEAGGVRYDTKADGSIDYSNITLGGGNSGTTRISNVSAGVNNNDAVNY 
AQLKQSVQETKQYTDQRMVEMDNKLSKTESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSANGRWVYKLQG 
STNSQGEYSAALGAGIQW 
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SEQ ID NO: 11 (Shigella flexneri) 

MTNLGEDALKWDKDNGVFTAAHGTETTSKI 

WDKDNGVFTMHGNNTASKITNILDGTVTATSSDAINGSQLYDLSSNIATYFGGNASVNTDGVFTGPTYKIGETNYYNVGDALAAINS 
SFSTSLGDALLWDATAGKFSAKHGTNGDASVITDVAIX5EISDSSSDAVNGSQLHGVSSYWDALGGGAEVNADGTITAPTYTI 
DNVGDALl^IDTTPDDALLWDADAGENGAFSAAHGKDKTASVITNVMGAISAASSDAINGSQLYTTNKYIADALG 
APTYTIANMYNNVGDALDALDDNALLWDKTANGGAGAYM^ 

AGNT DATYI EENGAG I NYVRTNDNDLAFNDASAS GVGATAVG YNAVASGAS S VAIGQNS S STVDTGI ALGS SS VS SRV IAKGS RDT S V 
TENGWIGYDTTDGELLGALSIGDDGKYRQIIOTADGSEAHDAV^ 

VNGNAGIGIGLNTLVMDAINGIAIGSNARANHMSIMGNGSQTTRGAQTGYTAYNMDAPQNSVGEFSVGSEDGQRQITNVAAGSAD 
TDAVNVGQLKVTDERVAQNTQSITNLNNQVTNLDTRVTNIENGIGDIVTTGSTKYFKTNTDGVDANAQGKDSVAIGSGSIAAADNSVA 
LGTGSVAEEENTISVGSSTNQRRITNVMSVNAT^ 

AVNYAQLKQSAQETKQYTDQRMVEMDNKLSKTESKLSGGIASAMMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSANGRW^ 
KLQGSTNSQGEYSAALGAGIQW 

SEQ W NO: 12 (Brucella melitensis) 

MS FFKKNI SITMGGMLSLAVDAAKAEENVSQm 
. LISENRQVINNNTTAIGQNRTSISTNMGVADNRMIRQNSMISALGQRVDGLQGQINSARKEARAGAANAAALSGLRY 
IATGVGGFKGSTALAAGIGYTSKNENARYNVSVAYNEAGTSWNAGASFTLN 

SEQ W NO: 13 (Brucella suis) 

MSFFKKNISITMGGIMSIAVDMKAEENVSGVK^ 

LISENRQVINNNTTAIGQNSDRI DANAKGVADNRAAIGQNSGRI DANAKGVADNKAAIGRNSGRI DANAKGVADNKTAIGRNSGRI DT 
NAKG VADNRAAI SQNRGRI NANAAGVASNRAAI RQNSAAI SALGQRVDGLQGQI NSARKEARAGAANAAALSGLR YDNRPGKVS I ATG 
VGGFKGSTALAAGIGYTSKNENARYNVSVAYNEAGTSWNAGASFTLN 

SEQ ID NO: 14 (Ralstonia solanacearum) 

MVFSMPQYACAMLLQNDPGTNCGSVGDAYAWARGDGYSGC^ 

LDSMNIGGHLDVWGASGFHGGVDMNNSAIKNLAD^ 

LDGDAVDFSGKKLSDSTTFSRKLTGVAEGTLSATSTDAVSGKQ^ 

ISGGSAGLVQQSAAGKDITVAKDLDGEAVDFSGKKLSDSTTFSRKLTGVAEGTLSATSTDAVSGKQLYTTNQNLASTNKDLANTNTRL 

TTAEGNLSSNTTSITNLQNTIKNISGGSAGLVQQSAAGKDITVAKDLDGDAVDFSGKKLSDSTTFSRKLTGVAEGTLSATSTDAVSGR 

QLYTTNQNLSTTNQNLADTNKSLAETNKNVSAT^ 

SRKLTGVAEGTLSATSTDAVSGKQLYTTNQNLSTTNQNIA^ 

VAKNLDGDAVDFSGKKLSDSTTFSRKLTGVAEGTLSATSTDAVSGKQLYTTNQNMSTNKDLMTNTRLTTAEGNLSSNTTSITNLQN 
TIKNISGGSAGLVQQSAAGKDITVAKDLDGDAVDFSGKNLSDSTTFSRKLTGVAEGTLSATSTDAVSGKQLYTTNQNLSTTNQNLADT 
NKSLAKTNNNVSATTTNITNLQNTVNNISSGSAGLVQQSMGKDITVMDLDGDAVDFSGKKLS 
VSGKQLYATNQOTSKLSANVTDVSDSVTNIKNTMNT 

NAVALGAGS VADRANTVS VGSAGKERQ ITNVAAGTADT DAVNVAQLKAAGI I NGSGRTNAT VT YGTNADGSADYGN VTLGGGNAPAGT 
AIHNVMGTAETDAVNVRQMNMIASVQKVSNTNDP^ 

ANAKATANHAVAVGSGSVMRMTMSVGSAGSERQITNVMGVQGTDAVNVSQLSQAVYMVGDLPAGTTARQYTDEQIG 
VARGAYSGIAAATALTMIPDVDQGKSIAIGIGSATYKGYQAVALGASARISHNLKAKMGVGYSSEGTTVGMGASYQW 

5 SEQ ID NO: 15 (Sinorhizobium meliloti) 

MALGRQSVSAGSGSLAFGNGSYANSNGSVAIGQSAYAANVRAIAIGG 

EADPGGASDAIAIGTDAQANGDRSLAIGRQNQAGNEQS IGIGAGNTATGKLS IGIGSSNVASGEQSLSLGAGNNALGQGS I S IGTETT 

AGGLRSIAFGVRASTKEMLDIPDDVMIDAIAIGTNTKANGDRSVSIGTGSQASSGAVSIGDAAKAVGDKSVSIGTESWADGDESVS 

IGLVNNAGFEGNDRIRGGQTSVSLGAFNQS PGIEAI AIGARNEANADRS IAIGSRAKTKAADPAQADGGARDAVAIGTDALANDDRS I 

SIGIWSSTSLNDSISIGTRATSGSAGDIMIGTGSGTGSTSGQNNVALGVAASQKVKGSSNIAIGDSAGGSREGDNNVAIGTNAGIQFS 

ESEHETAVRADLWSDAVSIGNEALASADEAIAIGTGAVASGLKSISIGVGNTVSGASSGAIGDPTDITGTGSYSLGNDNTIAADNAG 

TFGNDNTLADAADGSRVIGNGNNIDVSDAFVLGNGADVTEVGGVALGSGSVSDTGADVAGYVPGGASTADQNAIEATQSTRGAVAVGN 

PDAETGVYRQITGVAAGTADSDAANVAQLKSVETIAKTGWKLTTDSGSIDGIGPGDELVLKGGDGNIVISNQILSNDVSIDIA 

NRVTARDPDTGASTVLDENGLSFTTQDMGEDTALGPRVTMGIQMGKITNVMGEADTDAVNFSQLRQVETASGNTDQRAVKYDWT 

DMTNGVIDEGELNLDSVTLAGGMGGTRISNLAPGALSAASTDAWGSQLFGLRSRVSNVAVALGGGAAYDPVKDEWIAPKYTIGGTD 

YSNVGDALAAVGGTAGAGWSLSAQGmS^APGETVDLRSGDGNIWSKAETGDTVSFDLADDLDVSE 

GGSIVIGSTMLGSNGLVITGGPSVTTDGIDAGGMKVTNVANGTVAKDSKDAVNGGQLFDWANATANGVGYDDKSKGTLTLEGANGTK 
ITNVAAGDLNANSTDAVNGSQLYATNVKVDRLDTEVKEIDSRVTYIESFQGDLENAAVYDTDAAGKRLNTLTLEGGDPDKPVLIANVA 
KGVKATDAVNVGQLDESVAESKSYTDEKTEWAIDQM 

DNEPGKLSVALGGGFWRSEGALAFGAGYTSEDGRVRANLTGAAAGGNVGVGAGLSITLN 
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SEQ ID NO: 16 (Bradorhizobium japonicum) 

MRAFGSGNAINGTNYMVGSNNWAGNNGAWGSGNGVTGDNTMFGSSIGIAGGNNMVGSFSTVTGSNSAAVGSFNNVSGNNSGAF 
GTGQNIRGNGTEMGDPNIVNGNNSLVTO^ 

GASSVAIGNGTAVTGINAIAMGTGAGANFDNSVAIGSGATTTRANQVAVGTASSTYTMSGITSMSKMQSGPTQLVTSDAAGNLATT 

SLAGLGLASAGDINGINSQLAALNGRVDNLTRES 

FTAAQQGAIGVRAGASWTLN 

SEQ ID NO: 17 (Burkholderia fungorum) 

MNKTYRSVWNESTGTWVMSEHASARGKKSSMTSSTKAW 
SDYQWGLFSNTNADGSKSGQPIGMIEGMNDGSLLLYGPNN 

TVNNITTGAGIMYFHVNSTLADSTANGVNSIAIGGATRTDANNSISIGTGLTQASSNTGAIAIGQNASINVYGANSIAIGTNSATGGI 

GGAIALGENAFATGGKMLALGSGASATTMSVALGSGSm 

DAVNVSQLQSEDAKVNTINNNVMLSGS^ 

GTNAI ALGGAVAGGS YAFAAGS LALAATTGD I ALGSSATAS S ANSNAYATALGTNALANAT DATAIGEGASATAASS VALGARSKTTA 
NLSTAGYNPGTGTLSGTTPTGEVSVGSAGKERRVTNVMGSAAT 

NITNGGGIKYFHVNSTLADSSAGGTNSIAIGGGATTGNWAGTSDNISIGTNATTNYGKNIAIGGNAQALGGAYDGGYNTAIGENAIA 
KGDGAGGFGGGGWGQTTAIGGGSQALHDNTTAVGSGAIANVMATALGMSASATAGSAIALGQGAVASAANSVALGSGSTTTXNLSAA 
GYNPGTGTLSGIASVMGEVSVGMGKERRITOTMGSMTDAVNVSQLQSEDAKVNTINNNVNNL 

FHTKSTLADSSATGTDAVAIGGNAQATAMSVALGSNSTTTMLSAAGYNPGTGALSGIASAANGEVSVGAAGKERRITNVAAGSAAT 
DAVNVSQLQSEDAKVNTISNNVNNLSG^^^ 

SSNTGAIAIGQNASINVYGANSIAIGTNSATGGIGGAIALGENAFATC^ 
TLAGTSQATNGEVSVGNAGAERRITNVMGSMTDAVNVSQLQSED^ 

FHTKSTLADSSATGTDAVAIGGNAQATAMSVALGSNSTTTMLSAAGYNPGTGTLSGTTPTGEVSVGSAGKERRVTNVAAGSAATDA 

VNVSQLQSAIIGSTAMVAYDDGTKATVTMGASGTKITNLTAGNLSATSTDAVNGSQLYATNQNVSNIGNTVNNITNGG^ 

STQADSSATGSNSVAVGDRASSLGGSSVMGDGATAVGMSIAIGNNAQNVTGSNNSVAIGGDSKAGDRSVSLGNGADTSLSSWGVAV 

GTNANVSAALGTAIGAGANVSGANSTAIGANAVASATNSVALGSNSTTTANLSAAGYNPGTGTLSGIASAMGEVSVGM 

VMGSMTDAVOTSQLQSEDAKVNTIl^NVMLSG 

DNMAIGTNATASYGKNIAIGGGAQATGGTYDGGYNVALGENANATAGTNAWGHNTAIGANTVINGVNSVALGISATTSGSGSMFGSA 

AQASADYAIASGAGANASAWSVALGSNSTTTMLSMGYNPGT^^ 

SEDAKVOTINNNVMLSNNVSNIAGNVTNISNTVNNITNGGGGIK 

NLSMGYNPGTGTLSGTTPVGEVSVGSAGKERRVTNVAAGSAATDAVNVSQLQSAIIGSTANAVAYDDGTKATVTLKGASGTKITNLT. 

AGNLSATSTDAVNGSQLYATNQWSNVGNTVSNLSN^^^ 

ATAMSVALGSNSTTTMLSMGYNPGTGALSATTPTC^ 

NVSNIAGNVTNISNTTONITNGGSGIKYFHMST^ 

SAANGEVSVGMGKERRITOTMGSMTDAVOTSQLQSEDAKVOT 

TLADSSATGTDAVAIGGNASASAANSVALGSNSTTTANLSMGYNPGSAALSGTASMNGEVSVGMGKERRITNVMGSM 
SQLQSEDAKWAEGMTAMLGGGSTYNTTTGAITSP 

STITVAKDTGGATVDFRGTGNATRTLTGITAGELSATSTDAVNGSQLYATNQNVSNIDNTVSNLSMVTNIAGNVTNISNTVNNITNG 
GGGIKYFHANSTLADSSATGVDAVAIGGNAQATAANSVALGSNSTTTANLSAAGYNPGTGTLSGIASAANGEVSVGAAGKERRVTNVA 
AGSMTDAVNVSQLQSEDMVNTIMNVNNLSNNVSNIAGN^ 

AT DAVALGNGAKATNAGAVALGAGSTTTTAVATSGTT I GGI T YT FAGVAPSSTVSVGAAG S ERT I TNVAAGRLSATSTDAVNGS ELFA 

TNQQVTMTADITNLTNNMNIGSVGLVQQDATTRTITVAKATDGTRVDFTGTGGARQLTGVMGAVMTSVDAVNGSQLYGVSQSV 

AIGGGSTVNTDGSISAPTYWDGTTVHNAGDAISNLDN^^ 

AQSQAMSVALGSNSVADRMTVSVGMGAERQITNVMGTADTDAVWAQLKASGVINTDGTTNAAVTYD 
VAGGTTIHNVAAGSMDDAVNVSQMNAAISSVSNIIGSAGNPLFTADGNRDTEMVASGTHATAMGANAKASAANSVALGANSVADRE 
NTVSVGSAGNERQVTOTMGTATTDAVOTGQLNQAIGA 
QGKTIAVGVGGGSYKGSQAVALGISARITQNLKMKAGAGTSSQGTTVGLGASYQW 

SEQ ID NO: 18 (EPEC) 

MLIQQNSEVINQLAGNTSETYIEENGASINYVRTNDTGLTFTDASAAGIGSTAVGYNTVAKGDNSVAMGYNSFAEGHSSVAIGQGSYS 
GVETSIALGSESVSSRVIVKGSRNTSVSEEGWIGYDTTDGELLGALSIGDDGKYRQIINVADGSEAHDAVTVRQLQNAIGAVATTPT 
KYYHMSTAEDSLAVGEDSLMGAKTIVNGMGIGIG^ 

QNSVGEFSVGSEDGQRQITNVAAGSADTDAVNVGQLKVTDAQVSQNTQSITNLNTQVTNLDTRVTNIENGIGDIVTTGSTKYFKTNTD 
GVDANAQGKDSVAIGSGSIAAADNSVALGTGSVANEENTISVGSSTNQRRITNVAAGVNATDAVNVSQLKSSEAGGVRYDTKADGSID 
YSNITLGGGNGGTTRISNVSAGVNNNDAVNYAQLKQSVQETKQYTDQRMVEMDNKLSKTESKLSGGIASAMAMTGLPQAYTPGASMAS 
IGGGTYNGESAVALGVSMVSANGRWVYKLQGSTNSQGEYSAALGAGIQW 



-4- 



1 t 

WO 2004/113371 PCT/TO2004/002351 

SEQ ID NO: 19 

GSGGGG 

SEQ ID NO: 20 (Haemophilus aegyptius) 

GTACCGCACGAGCTGGCAAAAAAGGCACGGCAGTCTCTTTTGTCGM 
TGAGGAMTTTTAAAGGC&CGCATTTTAGCAGGTTTAGMCCT^ 

AAACAAAAGGCGCGCATTAAAGAAAAACGTGMGATMGAAAAAMCAGAGGCAMGAAAAAAGTAAMTTGCGTCATM 
AAMTATCGGCAMCGACGCMGCCMGTAATAGTAATAn^ 

AATTTTTCTATATTGCTTTMCGTTTTTTTATAGTAGAAGTAT ATACTCAG TTATGGTTATGGTTACATAG TATAGT TTTACTTTGTT 
CTAGTTCACTTTAATAACCTTAAATAATT GAGGA TTTCTTATGAAAAGAAATTTATTAAAACAATCTGTAATCGCTGTGTTGATAGGT 
GGCACTACTGTTTCTMTTATGCTTTAGCACMGCACMGCACMGCACAAG 

TAAAGGAAATGGATGCTGCTATCGATGGTATTCTTGATGATAATATTGCTTATGAAGCTGAAGTTGATGCAAAACTTGATCAGCATTC 
TGCTGCTCTTGGTAGACATACAAATAGACTCAATAATCTTAAAACGATTGCAGAGAAAGCAAAAGGTGATTCAAGTGAAGCACTTGAT 
AAAATTGAAGCTCTTGAAGAACAAAATGATGAGTTTTTAGCGGATATTACAGCTTTAGAAGAGGGAGTTGATGGTTTAGATGATGATA 
TCGCAGGTATTCAAGATAATATTTCTGATATAGAAGATGATATTAATCAAAATTCTGCAGACATCGCAACTAACACAGCGGCAATCGC 
AACTCACACTCAACGTCTTGATAATTTAGATAACAGAGTAAATAACCTTAATAAAGATCTTAAACGTGGTCTTGCTGCTCAAGCTGCA 
TTAAATGGTTTATTCCAACCGTATAACGTAGGTAAATTAAATCTTACTGCTGCTGTAGGTGGTTATAAATCTCAAACTGCAGTTGCTG 
TAGGTAC 

SEQ ID NO: 21 (Haemophilus somnus) 

ATGAAAAMGTACAATTTTTTAAATATTCATCATTGGCATTAGCATTGGGTTTAGGGGTMGTGCTTCTGCTTTGGCAGCCCCMCM 
GTACAAGTACGACTACTGGACCAGAGGCGCCTCCTACAGGCCCTGCTCCTACGGCGAAAGACCCTCTAGCAGAAACAGCGTTAGCCTA 
TGATTTGGAGAACGAAGTTGCGTATCTTCGTATGAAGGCGGGTGAGTGGATGCAATTGGGGCTTGATCCTGAAAAAGAAGTCATCAAA 
GGCTGGAATGAGGTAAMTCTCTCCCTCGTATCGATGGAMTGGAMGGATAAACAGACAAMGATCAAATAGCAATGTTGATAAGAA 
CGGTTGATMTACAAMGAGCTTGGTCGGATCGTTAGTACAAACATTGAAGATATTMGAACCTTAAAAAAGAGCTTTACGGTTTTGT 
AGAAGATGTGAACGAGAGTGAAGCACGCAATATCTCAAGAATAGATGAGAATGAGAAAGATATTAAGAACCTTAAAAAAGAGCTTTAC 
GATTTTGTAGAAGATGTGAACGAGAGTGAAGCACGCAATATCTCAAGAATAGATGAAAATGAGAAGGACATTAATACTCTTAAAGAGC 
TAATGGATGAGGATTTAAATTCAGTCTTAACCCAAATTGAAGATGTAAAACTCACATTTCAAGATGTCAATGATAACGTTAATTTGGC 
ATTTGAAGAGATTAATGGAAATGCCCAAAAGTTTGACACTGCTATTGAAGGACTTACTTCAGGTTTGAGCGATTTACAAGCTAAAGTC 
GATGCAAATAAACAAGAAACTGAAGACGATATTGCGGACAATGCCAAGGCTATTCATAGCAACACAAAAGGTATTGCTAAAAATACCA 
AGGATATTCGTGACTTGGACACCAAAACCAAGCAAATGTTGGAAAATGACAAAAACTTGATGACCGGTTTAGAATCTTTAGCAACAGA 
AACAAGCAAAGGCTTTGAAAGATTTGATGTCAAAACACAACAATTAGATCAAGCCGTCGCAAATGTCGTCGGTCGAGTAGACATAACT 
GAGCAAGCTATTCGCCAAAACACTGCAGGCTTAGTCAATGTGAATAAACGTGTCGATACACTCGACAAAMCACCAAAGCCGGTATCG 
CTTCTGCAGTCGCTTTAGGTATGTTGCCACAATCCACTGCTCCGGGTAAATCATTAGTGAGCTTAGGTGTCGGTCATCACCGTGGGCA 
AAGTGCTACTGCTATTGGAGTATCTTCTATGAGCAGTAACGGTAAATGGGTTGTTAAAGGCGGTATGAGCTATGATACACAGCGTCAT 
GCTACTTTCGGCGGTTCTGTCGGTTTTTTCTTTAACTAA 

SEQ ID NO: 22 (Escherichia coli) 

ATGAAAACTGTAAACGTAGCTTTACTGGCACTCATAATTTCAGCAACATCCAGCCCTGTTGTTTTAGCTGGTGATACCATTGAAGCGG 
CGGCAACAGAGCTTTCAGCCATTAACTCTGGCATGTCGCAATCGGAGATTGAGCAGAAGATTACCCGCTTTTTAGAACGCACAGACAA 
CAGCCCCGCTGCGTATACCTATTTGACTGAACATCACTACATCCCTTCTGAAACACCTGATACCACTCAGACTCCCACTGTCCAGACA 
GATCCTGACGCAGGACAAAAAACCGTTGCCGCTACAGGTGATGTACAGACAACTGCCCGTTATCAGAGCATGATCAACGCCCGACAGT 
CTGCGGTAACTGACGCCCAGCAAACGCAAATTACAGAGCAACAGGCGCAGATCGTAGCCACACAAAAAACGCTCGCCGCGACTGGAGA 
TACGCAAAATACCGCGCATTATCAGGAMTGATTMTGCCAGACTGGCGGCTCAAAATGAGGCTMTCAGCGCACCGCCACTGMCAA 
GGGCAGAAMTGMTGCGCTGACAACCGATGTGGCAGTACAACAGCAAMTGAAAGGACTCMTACGATAAACAAATGCAAAGTCTGG 
CGCAGGAGTCTGCCCAGGCACATGAACAAATTGACAGCCTGTCACAAGACGTAACCCAAACGCACCAACAGTTAACCAACACCCAAAA 
ACGGGTTGCAGATAACAGCCAGCAAATTAACACGCTCAATAACCATTTCAGTTCGCTAAAAAACGAAGTTGATGACAATCGTAAAGAA 
GCCAATGCGGGAACTGCATCTGCCATCGCTATCGCCTCACAACCACAGGTTAAAACCGGTGACGTGATGATGGTGTCAGCGGGAGCGG 
GAACCTTCAACGGTGAATCTGCGGTGTCTGTCGGAACATCATTTAATGCCGGAACGCATACGGTACTTAAAGCCGGTATTTCTGCGGA 
TACACAATCTGATTTCGGCGCAGGTGTCGGCGTGGGATATTCGTTCTAA 

5 SEQ ID NO: 23 (Escherichia coli) 

ATCGCCAAACAGCGTCGGCGTCTGGGCGCAGTAAGAGACTTGCTGACGGTAGATTTCTGGCTTTAGTGTGCTGACATCCTCACCTTCA 
AACAGTAACGTTCCGCTGGTTGGGCTGATCAATGAAGCAACTATTTTTAGCAGCGTACTTTTGCCACAACCAGAAGGACCGGTAATTA 
ACTTAAATTCGCCAGCACGCAGCGAAAAATTGATGTTATTAAGAATCTTCGCATCACCCGCCAGATATCCTACGTTTTGTAGCTGAAG 
CAAAGGACTATTTTCCTGCATCGCTGTTCCCTTTTTCTGATTTTTACTAAAAACAGTTTATCCTTCGCAGGAATAAGGGGGAACTCTC 
TTTCAGTAATCAGGTATAATTTGTTTAATCAAGCAGCAATTTTGCTGATTATGGACGGTACAACAGGAGGTTTTTCCGATGCTTATCT 
TTATCCCGATTCTCATTTTTGTCGCGCTGGTCATTGTCGGCGCGGGCGTCAAAATCGTGCCGCAGGGCTATCAATGGACGGTAGAACG 
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TTTTGGTCGCTATACCAAAACGTTACAGCCGGGGCTCAGTCTGGTGGTGCCGTTTATGGATCGCATTGGTCGCAAGATCMTATGATG 
GAGCAAGTGCTCGATATCCCTTCCCAGGAAGTTATCTCGAAAGATAACGCCAACGTTACCATCGACGCAGTCTGTTTTATTCAGGTGA 
TTGACGCGCCACGCGCGGCTTATGAAGTCAGCAATCTGGAGCTGGCGATCATCAACCTGACCATGACTAACATCCGTACCGTGTTGGG 
TTCMTGGAACTTGACGAAATGCTCTCTCAGCGCGACAGCATCAACTCACGCCTGCTGCGTATTGTCGATGAGGCCACCAACCCGTGG 
GGGATTAAAGTCACCCGTATTGAMTTCGCGACGTGCGCCCACCGGCAGAGCTTATCTCTTCMTGAACGCGCAGATGAMGCGGMC 
GTACCAMCGCGCTTACATTCTTGMGCGGMGGGATCCGTCAGGCGGAAATCCTCAMGCCGMGGTGAAAAACAGTCGCAAATCCT 
GAAAGCGGAAGGCGAACGTCAGTCGGCGTTTTTACAGGCTGAAGCGCGTGAACGTTCCGCTGAAGCAGAAGCCCGCGCCACCAAMTG 
GTGTCTGAAGCCATCGCCTCCGGTGATATTCAGGCGGTGAACTACTTCGTAGCGCAGAAATACACCGAAGCGTTACAGCAGATCGGTT 
CCTCCAGTAACAGCAAAGTAGTGATGATGCCATTAGAGGCCAGCAGCCTGATGGGGTCGATTGCCGGGATTGCCGAGCTGGTGAAAGA 
CAGCGCCAACAAG CGGA CTCAGCCATGATGGAGTTAATAGTCGTTCATCCACATATTTTCTGGCTCAGTCTCGGCGGTTTGCTGCTGG 
CAGCCGAGATGCTGGGCGGAAATGGTTATTTGTTGTGGAGTGGCGTGGCAGCAGTGATTACTGGCCTGGTGGTCTGGCTGGTGCCGCT 
GGGTTGGGAGTGGCAAGGGGTGATGTTTGCCGTCCTGACGCTGCTCGCCGCCTGGCTGTGGTGGAAATGGTTGTCGCGGCGGGTGCGC 
GMCAAMGCACAGCGACAGTCATTTAAACCAGCGCGGGCAGCAGCTGATTGGCCGACGTTTTGTGCTGGAATCTCCGCTGGTCAACG 
GGCGCGGTCATATGCGCGTCGGTGACAGTTCATGGCCTGTCAGCGCCAGCGAGGATCTCGGCGCAGGTACGCATGTTGAAGTCATTGC 
GATAGMGGGATMCGCTGATCATCCGTGCGGTCATra^ 

TCGTAGGCCGGATAAGGCGTTTACGCCGCATCCGGCAGTCGTGCACCGACGCCTGATGCGACGCGGGCGCGTC^ 

CGIMGCCGCCTCCGCCATGTT^ 

GGATAGTGTmTTTAGATATTCCAGGACGTTlT 

GGTAAACTTCTGGCGGAATGGTGAAATC&GAAAGCGTTAACCM 

CATAATCAGCG&CATGGCAGAGGCAGGAGCCGTACTATCGC^ 

GGTAGCAATGCCCGGTCCM&TTCATTTCATTAAAAATC^ 

AAAITCGAGATTTATCAAAGAAACGCATTTTAGCAra 

AAAACAAGTCTTXACTCmTTTACATTGAAAGAGCACGAAM 

GATGGTGTTTACGCTTACAACAGACAAAAATGCGCTTTACATCACAOVA^ 

TTTCTTAAAACAA2MTATTTGTTTCTTATAG 

GCTTTACTGGCACTCMAATTTCAGCMCATCCAGC^ 

CCATTMCTCTGGCATGTCGCAATCGGAGATTGAGCAGAAGATTACCCGCTTTTTAGAACGCAC^ 

CTATTTGACTGAACATCACTACATCCCTTCTGAAACACCTGATACCACTCAGACTCCCA 

AAAACCGTTGCCGCTACAGGTGATGTACAGACAACTGCC^^ 

AGCAAACGCAAATTACAGAGCAACAGGCGCAGATCGTAGCCACACAAAAAACGC 

TTATCAGGAAATGATTAATGCCAGACTGGCGGCTCAAAATGAGGCTAATCAGCGCACCGCC^ 

CTGACAACCGATGTGGCAGTACAACAGCAAAATGAAAGGAC 

CACATGAACAAATTGACAGCCTGTC^CAAGACGTMCCCAAACGCACCAAC^ 

CCAGCAAATTAACACGCTCAATMCCATTTCAGTTCGCTAAAAAACGAAGTTGATGACAATCGT 

TCTGCCATCGCTATCGCCTCACAACCACAGGTTAAAACCGGTGACGTGATGATGGTGTCAGCGGGATC 

CTGCGGTGTCTGTCGGAACATCATTTAATGCCGGAACGCATACGGTACTTAMGCCGGTATTTC 

CGCAGGTGTCGGCGTGGGATATTCGTTCTAATATTTCAAT^ 

TAATTTGTACATTATTTAAGGTTAACAATGATGAATAGCTCCATTAAATCGTTTTCCCTGCTGGCGGTTATATTACTGGCTGGCTGTA 

GTTCACCCACTTCCCGC&TCGCAGATT^ 

TATTTTAAGTGCATCCGAGGCACAGGCATTTAAAA^ 

TTTGGCATGACCTTTAGAATGAGC&GTAAAAACTTTGCT^^ 

CTGTTTATCAGTC&GGTCTATATAACGTC&TTGTTTA^ 

TTTAAAATGAAGGAGCAAAGGAAAATACCCCI^ 

TAGCCTCCCTGGTTCACGCTGTC^ 

GCATATTCTTTCGGTGCAAACGGGGATTCACTGATTTTC^CCCCGCCCGATGATGACAGCAGCCGGAGAGATTTTCGATMTC 

GTCGGCGCTGTCATCACCGGGGCAGGCATTCGCCAGCGCCAGTAGCTGGTTGCGCATAGATTGCAGTTCTTCGATATGCCGTTCAATC 

TCCGCCACCTTCTCCAGCGTGCGACGTTTGACGTCGGCACTGTGACGCTGCGGGTCGTTAAACAGATTCACCAGCTCGCCGCTCTCTT 

CCAGGTTAAAGCCCACCTGGCGCGCCTGGCGCAGTAAGGTCAATTCGTTGAGATGCTGCTGCGTGTAGGTTCGATAACCATTTTCGCT 

GCGCATCGGCGGCGTCACCAGCCCCTTCTCTTCATAGAAGCGAATGGCTTTGCTGGTCAGGCCGGTAATTTTTGCTACATCGCTAATG 

TTCATCGTTCGCGCAACGCC 

SEQ ID NO: 24 (EPEC) 

ATGAAAACTGTAAACGTAGCTTTACTGGCACTCATAATTTCAGCAACATCCAGCCCTTTTGTTTTAGCTGGTGATACCATTGAAGCGG 
CGGCAACAGAGCTTTCAGCCATTAACTCGGGCATGTCGCAATCGGAGATTGAGCAGAAGATTACCCGCTTTTTAGAACGCACCGACAA 
CAGCCCCGCTGCATATACCTATTTGACTGAACATCACTACATCCCTTCTGAAACACCTGATACCACTCAGACTCCCCCTGTCCAGACA 
GATCCTGACGCAGGACAAAAAACCGTTGCCGCTACAGGTGATGTACAGACAACCGCCCGTTATCAGAGCATGATCAACGCCCGACAGT 
CTACGGTAACTGATGCCCAGCAAACGCAAATTACAGAGCAACAGGCGCAGATCGTAGCCACACAAAAAACGCTCGCCGCGACTGGAGA 
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TACGCAAMTACCGCGCATTATCAGGAGATGATTMTGCCAGGCT 

GGGCAGAAAATGAATGCACTGACAACCGATGTGGCAGCACAACAGCAAAAAGAAAGGGCTCAATACGATAAACAAATGCAAAGTCTGG 
CGCAGMGTCTGTCCAGGCACATGAGCAMTTGAMGTCTGAGACMG^ 

ACGGGTTGCAGATMCAGCCAGCAMTTMCACGCTCMTMC^TTTCAGTTCGCTAAAAMCGAAGTTGAGGACMTCGTAMGM 
GCCMTGCGGGAACTGCATCTGCCATCGCTATCGCCTCACMCCACAGGTGAAAACCGGTGACTTGATGATGGTCTCAGCGGGAGCGG 
GAACCTTTAACGGTGAATCTGCGGTGTCTGTCGGAACATCTTTTAATGCCGGAACGCATACGGTACTTAAAGCAGGTATTTCTGCGGA 
TACACAATCTGATTTCGGTGCGGGTGTCGGCGTGGGATATTCGTTCTAA 

SEQ ID NO: 25 (EAEC) 

ATGAAAACTGTAAAGCTGTCTTTACTGGCTGTCGTTGTTGCTACCGCGGTAAGTCCATCTGCGTTTGCGGGTGATACTGTTGAGGCGG 
CAACGACAGAATTAACGGTAATCCAGCCAGGAATGTCGCAATCGGAAATTGATCAGAAAATTGGTCGATTTTTAGAAAGGACAGGGAA 
TAGTGTAGCCGCACAAAATTATCTGATTGCGCATGATTACCAGACAACGACGCCTCAGGAAAATACAGCTGCTTCTCCCGTACAGCCC 
ACCAATACGCTGAACCCGATAACCAATCMGCGCAGACCGACCGCGACAACGGGCAGGATACCGCCATTCAGGACGCGCAGCACGCCG 
CCAACTGGGCTTCACTGAAAGCTGATGACGCGCAGCACGCCATCACGGTGGCGCAGACGGATATTGATGCCMTACAGCCGCCATCAC 

CGATACCCGTMTGATGTCTCCGCAGTGCAGTC^ 

GCCAACGCTAACGCCAACACCGCTCTGATTAACGGCGTCAAACTTTCCGGTGCTGTGACAGAAAACAAAAACAACATCGAACAGAACC 
GCAGCGATATTGCTGACCAGCAG2\AACTGTTGGCATCAAACGAGCAAAAACAGATCGTCCGCGACAACGGGCAGGATACCGCCATTCA 
GGACGCACAACATGCCGCCAACTGGGCTTCACTGAAAGCTGATGACGCGCAACACGCCATCACGGTGGCGCAGACGGATATTGATGCC 
AATAAAGCCGCCATCACCGACATCCGTAATGATGTCTCCGCAGTGCAGTCAGACGTCACCAACATAAAAGGCGATGTCGCACATGCCC 
AGTCAACGGCTGACCATGCCMCGCTMCGCCMCACCGCTCTGATGAACGGCGTCAAACTCTCCTCTGCTGTGACAGAAMC7^AAM 
TAATATCGAACAGMCCGCAGCGATATTGCTGACCAGCAGAAACTGTTGGCATCAAACGAGCAAAAACAGATCGTCCGCGACAACGGG 
CAGGATACCGCCATTCAGGACGCACAACATGCCGCCAACTGGGCTTCAATGAAAGCTGATGACGCGCAGCACGCCATCACGGTGGCGC 
AGACGGATATTGATGCCAATAAAGCCGCCATCGCCGACACCCGTAATGATGTCTCCGCAGTGCAGTCAGACGTCACCAACATAAAAGG 
CGATGTCGCACATGCCCAGTCAACGGCTGACCATGCCAACGCTAACGCCAACACCGCTCTGATTAACGGCGTCAAACTTTCCGGTGCT 
GTGACAGAAAACAAAAATAATATCGAACAGAACCGCAGCGATATTGCTGACCAACAGCAACAACTCGACGAAACCCGGAAAATCGTTG 
CCGCCACCGGGGATGTGCAGACTGCCGCCCGATATCAGAGCATGATTGACGCCAGACAGACAGCCGCTGCCAATGCGCAACAGGCACA 
AGCTGACACCCAGCAGCAACAAATGGACGATCAGCAGAAACAAATCGACGCGACGCAAAAAACGGTTTCCGCACTTGGCGATGCCCAG 
ACCMCGC^CATTATCMGAGATGGTTMCGCCGGACTGAGAGCACAAMTGATGCGAATGCGCGTACTGCAGCAGAACAAAAACAAA 
AAATAGATACTCTGGCGACTAACCAGGCAACGCAACAGCATATCAATAGTGTGCAGTACGGGGAACAAATTCAGCGTCTGGCGCAAGA 
CTCAACACAAACGCATGAACAAATTGACAGCCTGACACAAGACGTAACCCAAACGCATCAGCAGTTAAGCAACACGCAAAAACGAGTA 
GCGGATMTAGCCAGCAGATTACTACGCTCAATAACCATTTCAGTTCGCTGAAAAACGAAGTTGAGGACAACCGTAAAGAAGCCAATG 
CGGGAACTGCATCAGCCATCGCTATCGCCTCACAACCACAGGTGAAAGCCGGTGACTTTATGATGATGTCAGCGGGAGCGGGAACCTT 
CAACGGTGAATCTGCGGTGTCTGTCGGAACATCTTTTAATGCCGGAACGCATACCGTGATTAAAGCCGGTGTCTCTGCGGATACGCAA 

TCTGATTTCGGCGCGGGTGTCGGCGTGGGATATTCTTTCTAA 

SEQ ID NO: 26 (UPEC) 

ATGMCAAMTATTTAMGTTATCTGGAATCCGGCMCAGGCAGTTACACCGTTGCCAGCGAAACGGCGMGAGCCGTGGTAAAAAM 
GCGGGCGCAGTAAGCTGTTAATTTCTGCACTGGTTGCGGGTGGGTTGTTGTCGTCGTTTGGGGCAAGTGCAGATAATTACACTGGGCA 
GCCAACTGATTATGGCGATGGCTCAGCAGGTGACGGCTGGGTTGCTATCGGTAAAGGGGCAAAAGCAAATACCTTTATGAACACTAGT 
GGCGCGAGTACAGCTTTAGGATATGACGCGATAGCCGAAGGTGAGTACAGTTCTGCCATCGGGTCAAAAACCCTTGCAACTGGTGGAG 
CATCCATGGCGTTCGGGGTTAGTGCAAAAGCAATGGGTGACAGAAGTGTCGCGCTAGGTGCATCGTCAGTAGCAAATGGCGATCGTTC 
GATGGCTTTTGGTCGTTACGCAAAGACGAATGGTTTTACATCTCTTGCTATTGGGGACTCCTCCCTTGCCGATGGTGAAAAAACTATT 
GCGTTAGGAMTACGGCTAMGCTTACGAMTTATGA^^^ 

TGGGAGCAAGTAGCAAAGCTGGCGGTGCTGATAGCCTCGCATTCGGCAGAAAATCTACAGCTAATAGCACTGGCTCACTGGCAATAGG 
TGCTGACAGTAGCAGTTCGAACGATAACGCCATCGCGATAGGGAACAAAACGCAAGCCCTGGGAGTGAATTCGATGGCCCTGGGTAAT 
GCAAGTCAGGCATCTGGCGAATCCAGTATTGCATTAGGTAACACCAGTGAAGCCAGCGAACAAAATGCGATTGCGCTGGGGCAAGGTA 
GCATTGCAAGCAAAGTGAACTCAATCGCGTTGGGAAGTAACAGTTTGTCCTCGGGAGAGAATGCCATCGCATTGGGAGAGGGTAGTGC 
CGCTGGTGGCAGCAACAGCCTTGCTTTCGGTAGCCAGTCCAGGGCAAACGGCAATGATTCTGTCGCCATCGGTGTAGGGGCTGCAGCA 
GCGACCGACMTTCTGTCGCTATCGGCGCAGGATCGACCACAGATGCMGCMTACGGTTTCAGTTGGCMCAGCGCAACAAAACGCA 
AAATTGTTAATATGGCTGCTGGTGCCATAAGCAACACCAGTACCGATGCCATCAACGGCTCACAGCTTTATACGATCAGTGATTCAGT 
CGCCAAGCGACTCGGAGGAGGCGCTACTGTAGGCAGCGATGGCACCGTAACCGCAGTAAGCTACGCGTTGAGAAGCGGAACCTATAAT 
AACGTGGGTGATGCTCTGTCAGGAATCGACAATAATACCCTACAATGGAATAAAACCGCGGGGGCGTTCAGCGCCAATCACGGTGCAA 
ATGCCACCAACAAAATCACTAATGTTGCTAAAGGTACGGTTTCTGCAACCAGCACCGATGTAGTAAACGGCTCTCAATTGTACGACCT 
GCAGCAGGATGCTCTGTTGTGGAACGGCACAGCATTCAGTGCCGCACACGGCACCGAAGCCACCAGCAAAATCACTAACGTCACCGCT 
GGCMCCTGACTGCCGGCAGCACTGACGCCGTTMCGGCTCTCAGCTCAAMCCACCMCGACMCGTGACGACCAACACCACCAACA 
TCGCCACTAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAGCAGCTGG 
CGCATTCAGCGCCGCGCACGGCACCGAAGCCACCAGCAAMTCACCAACGTCACCGCTGGCAACCTGACTGCCGGTAGCACTGACGCC 
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GTTMCGGCTCCCAGCTCAAMCCACCMCGACMCGTGACGACCMCACCACCAACATCGCCACTMCACCACCMTATCACCAACC 
TGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCGCCGCGCACGGCACTGACGC 
CACCAGCAAGATCACCAACGTCACCGCTGGCMCCTGACTGCCGGCAGCACTGACGCCGTTAACGGCTCCCAGCTCAAAACCACCMC 
GACAACGTGACGACCAACACCACCAACATCGCCACTAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTCGGTGACG 
ACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCGCCGCGCACGGCACTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGG 
TGACCTGACAGCTGGCAGCACTGACGCCGTTAACGGCTCTCAGCTCAAAACCACCAACGATAACGTGTCGACCAACACCACCAACATC 
ACCAACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCGCCGCTCACGGCA 
CTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGGTGACCTGACAGCTGGCAGCACTGACGCCGTTAACGGCTCCCAGCTCAAAAC 
CACCAACGATAACGTGTCGACCAACACCACCAACATCACTAACCTGACGGATTCCGTTGGCGACCTTAAGGACGATTCTCTGCTGTGG 
MCAMGCGGCTGGCGCATTCAGCGCCGCGCACGGTACCGMGCTACCAGCAAGATCACCMCTTACTGGCTGGCAAGATATCTTCTA 
ACAGCACTGATGCCATTMTGGCTCACMCTTTATGGCGTAGCGGATTCATTTACGTCATATCTTGGTGGTGGTGCTGATATCAGCGA 
TACGGGTGTATTAAGTGGGCCAACCTACACTATTGGTGGTACTGACTACACTAACGTCGGTGATGCTCTGGCAGCCATTAACACATCA 
TTTAGC7VCATCACTCGGCGACGCCCTACTTTGGGATGCAACCGCAGGCAAATTCAGCGCCAAACACGGCATTAATAATGCTCCCAGTG 
TAATCACTGATGTTGCAAACGGTGCAGTCTCGTCCACCAGCAGCGACGCCATTAACGGTTCACAACTTTATGGTGTTAGTGACTACAT 
TGCCGATGCTCTGGGCGGGAATGCTGTGGTGAACACTGACGGCAGTATCACTACACCAACTTATGCCATCGCTGGCGGCAGTTACAAC 
MCGTCGGTGACGCGCTGGAAGCGATCGATACCACGCTGGATGATGCTCTGCTGTGGGATACAACAGCCAATGGCGGTMCGGTGCAT 

TTAGCGCCGCTCACGKjGAMGATAAMCTGCCAGTGTMTC^^ 

TAATGGCTCACAGCTCTATAGCACTAATAAGTACATCGCTGATGCGCTGGGTGGTGATGCAGAAGTCAACGCTGACGGTACTATCACT 

GCACCGACTTACACCATTGCAAATACCGATTACAACAACGTCGGTGAAGCCCTGGATGCGCTCGATAATAACGCGCTGCTGTGGGATG 

AAGACGCAGGTGCCTACAACGCCAGCCATGATGGCAATGCCAGCAAAATCACCAACGTTGCGGCTGGTGATCTCTCCACAACCAGTAC 

CGATGCTGTTAACGGTTCCCAGTTAAACGCAACCAATATTCTGGTTACGCAAAATAGCCAAATGATTAACCAGCTTGCTGGTAACACT 

AGCGAAACCTACATCGAGGAAAACGGTGCGGGTATTAACTATGTACGTACCAACGACAGCGGCTTAGCGTTCAACGATGCCAGCGCTT 

CAGGTATTGGCGCTACAGCTGTAGGTTATAACGCAGTTGCCTCTCATGCCAGCAGTGTAGCGATCGGTCAGGACAGCATCAGCGAAGT 

TGATACGGGTATCGCTCTGGGTAGCAGTTCCGTTTCCAGCCGTGTAATAGTTAAAGGGACTCGTAACACCAGCGTATCGGAAGAAGGT 

GTTGTGATTGGTTATGACACCACGGATGGCGAACTGCTTGGCGCGTTGTCGATTGGTGATGACGGTAAATATCGTCAAATCATCAACG 

TCGCGGATGGTTCTGAAGCCCATGATGCGGTCACTGTTCGCCAGTTGCAAAACGCCATTGGTGCAGTCGCAACCACACCAACCAAATA 

CTATCACGCCAACTCAACGGCTGAAGACTCACTGGCAGTCGGTGAAGACTCGCTGGCAATGGGCGCGAAAACCATCGTTAATGGTAAT 

GCGGGTATTGGTATCGGCCTGAACACGCTGGTTCTGGCTGATGCGATCAACGGTATTGCTATCGGTTCTAACGCACGCGCAAATCATG 

CCGACAGCATTGCAATGGGTAATGGTTCTCAGACTACCCGTGGTGCGCAGACCAACTACACTGCCTACAACATGGATGCACCGCAGAA 

CTCTGTGGGTGAGTTCTCTGTCGGCAGTGMGACGGTCMCGTCAGATCACCAACGTCGCAGCAGGTTCGGCGGATACCGATGCGGT 

MCGTGGGTCAGTTGAAAGTMCGGACGCGCAGGTTTCCCAGMTACCCAGAGCATTACTMCCTGAACACTCAGGTCACTAATCTGG 

ATACTCGCGTGACCAATATCGAAAACGGCATTGGCGATATCGTAACCACCGGTAGCACTAAGTACTTCAAGACCAACACCGATGGCGC 

AGATGCCAACGCGCAGGGTAAAGACAGTGTTGCGATTGGTTCTGGTTCCATTGCTGCCGCTGACAACAGCGTCGCACTGGGCACGGGT 

TCCGTAGCAGACGAAGAAAACACCATCTCTGTGGGTTCTTCTACCAACCAGCGTCGTATCACCAACGTTGCTGCCGGTGTTAATGCCA 

CCGATGCGGTTAACGTTTCGCAACTGAAGTCTTCTGAAGCAGGCGGCGTTCGCTACGACACCAAAGCTGATGGCTCTATCGACTACAG 

CAACATCACTCTCGGTGGCGGCAATAGCGGTACGACTCGCATCAGCAACGTTTCTGCTGGCGTGAACAACAACGACGCAGTGAACTAT 

GCGCAGTTGAAGCAAAGTGTGCAGGAAACGAAGCAATACACCGATCAGCGCATGGTTGAGATGGATAACAAACTGTCCAAAACTGAAA 

GCAAGCTGAGTGGTGGTATCGCTTCTGCAATGGCAATGACCGGTCTGCCGCAGGCTTACACGCCGGGTGCCAGCATGGCCTCTATTGG 

TGGCGGTACTTACAACGGTGAATCGGCTGTTGCTTTAGGTGTGTCGATGGTGAGCGCCAATGGTCGTTGGGTCTACAAATTACAAGGT 

AGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGTGCCGGTATTCAGTGGTAA 

SEQIDNO:27(EHEC) 

ATGAACAAAATATTTAAAGTTATCTGGAACCCTGCGACAGGGAATTATACTGTTACCAGCGAAACGGCAAAAAGCCGTGGCAAGAAAT 
CTGGGCGCAGTAAGCTGTTAATTTCTGCGCTGGTTGCGGGTGGAATGTTGTCGTCGTTTGGGGCATTGGCGAATGCCGGGAATGACAA 
CGGTCAGGGTGTTGATTACGGTAGTGGATCAGCTGGCGACGGCTGGGTTGCTATAGGCAAAGGGGCGAAAGCAAATACTTTTATGAAC 
ACCAGTGGTTCCAGTACTGCTGTGGGTTATGACGCTATAGCTGAAGGCCAATATAGCTCTGCCATCGGGTCAAAAACCCATGCGATTG 
GTGGTGCATCAATGGCCTTTGGGGTTAGTGCAATATCAGAAGGCGATAGAAGTATAGCACTGGGTGCCTCTTCGTATTCATTGGGCCA 
ATACTCAATGGCCCTCGGCCGTTATTCAAAAGCATTGGGTAAATTGTCTATTGCTATGGGGGACTCTTCCAAAGCGGAAGGAGCAAAC 
GCCATTGCCCTGGGAAATGCCACTAAAGCTACTGAGATTATGAGTATTGCTCTTGGCGACACCGCCAATGCGTCAAAAGCGTATTCAA 
TGGCGCTGGGAGCAAGTAGCGTCGCATCTGAAGAAAACGCTATTGCGATAGGTGCTGAGACCGAAGCCGCTGAAAATGCAACTGCTAT 
TGGCAATAATGCGAAGGCAAAAGGGACTAATAGCATGGCAATGGGGTTCGGAAGCCTTGCCGATAAAGTCAATACTATCGCATTAGGA 
AATGGCAGCCAGGCTCTGGCAGATAATGCAATCGCCATAGGCCAGGGCAACAAAGCTGATGGCGTGGATGCCATCGCTCTGGGTAATG 
GTAGCCAGTCGAGAGGCTTAAACACCATTGCCTTAGGCACAGCCAGTAATGCAACTGGTGATAAGAGTCTTGCGCTTGGTAGTAATAG 
CAGTGCCAACGGTATTAACTCTGTCGCGCTGGGCGCAGATTCCATTGCGGATTTAGACAATACCGTCTCTGTCGGCAATAGTTCATTA 
AAACGCAAGATCGTTAATGTGAAAAATGGCGCGATCAAGTCTGACAGTTACGATGCCATTAATGGTTCACAGCTTTATGCCATTAGCG 
ACTCGGTAGCAAAAAGGCTTGGAGGAGGGGCTGCAGTAGATGTTGATGACGGTACTGTTACAGCACCAACCTACAATTTAAAAAATGG 
TAGCAAAAATAACGTAGGGGCTGCGCTCGCTGTACTTGATGAAAACACCCTGCAATGGGACCAAACCAAAGGCAAATACAGCGCTGCT 
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CATGGTACTAGTAGCCCMCTGCCAGCGTAATCACCGATGTTGCGGATGGCACGATTTCAGCCTCCAGTAAGGATGCGGTTAACGGTT 

CCCAACTGAAAGCTACCAATGACGATGTCGAAGCCAACACCGCCAATATCGCTACTAATACCAGCAACATTGCCACGAATACGGCAAA 

TATTGCCACCMTACCACCMTATCACCMCCTGACGGATTCCGTTGGTGACCTTCAGGCTGATGCCCTGCTCTGGAACGAAACTAAA 

AAGGCATTCAGTGCAGCTCACGGCCAGGATACCACCAGCAAAATCACCAACGTTAAAGATGCCGACCTGACGGCTGACAGCACTGATG 

CTGTTAACGGCTCTCAGCTGAAAACCACCAACGATGCTGTGGCGACGAATACCACCAATATCGCCAATAACACTTCCAATATTGCCAC 

TAACACCACCAACATCTCTAACCTGACTGAGACGGTGACTAATCTTGGTGAGGATGCGCTGAAATGGGATAAGGACAATGGTGTATTC 

ACGGCAGCTCATGGCACCGAGACCACCAGCAAAATCACCAACGTTAAAGATGGCGACCTGACKACTGGCAGCACCGATGCCGTTAACG 

GCTCTCAGCTGAAAACCACCAACGATGCCGTGGCGACGAATACCACCAATATCGCCACTAACACCACCAACATCTCTAATCTGACTGA 

GACGGTGACTAATCTTGGTGAGGATGCGCTGAAATGGGATAAGGACAATGGTGTCTTCACTGCAGCTCATGGCAACAATACCGCCAGC 

AAMTCACCMTATCCTGGACGGCAGVGTCACTGCAACCAGTTCCGATGCCATTAACGGTAGCCAGCTTTATGACTTAAGCAGCAATA 

TCGCCACCTACTTCGGCGGCAATGCTTCTGTGAATACTGACGGTGTGTTTACCGGTCCAACCTACAAAATCGGTGAAACAAATTATTA 

TAACGTCGGCGATGCACTGGCTGCGATTAACTCCTCATTTAGCACGTCTCTCGGCGATGCTCTGCTTTGGGATGCCACCGCAGGTAAA 

TTCAGTGCCAAACACGGTACTAATGGTGACGCAAGCGTGATCACTGATGTCGCAGATGGTGAAATTTCAGACTCCAGTTCTGACGCAG 

TAAACGGCTCACAACTCCACGGCGTGAGCAGTTATGTTGTTGATGCGCTGGGGGGTGGTGCCGAAGTCAATGCAGACGGCACCATCAC 

TGCGCCGACGTACACCATTGCTAATGCTGATTACGATAATGTCGGTGATGCCCTGAATGCTATCGATACCACTCTTGACGACGCTCTG 

CTCTGGGATGCGGACGCCGGTGAAAATGGTGCATTTAGCGCCGCTCACGGAAAAGATAAAACTGCCAGTGTAATCACTAACGTCGCTA 

ACGGTGCAATCTCTGCTGCCAGCAGCGACGCGATTAACGGCTCACAACTCTATACCACCAATAAGTACATCGCTGATGCGCTGGGTGG 

TGACGCAGAAGTCAACGCTGACGGCACCATCACCGCACCGACTTACACCATTGCGAACGCCGAGTACAACAACGTCGGTGACGCCCTG 

GATGCGCTTGATGATAACGCCCTGCTGTGGGATGAGACTGCCAATGGCGGTGCTGGAGCCTACAATGCCAGCCATGACGGTAAAGCCA 

GCATCATCACTAATGTCGCTAATGGCAGTATTAGTGAGGACAGTACCGATGCAGTGAACGGTTCTCAGTTGAATGCGACGAATATGAT 

GATTGAGCAGMCACCCAAATTATCAATCAGCTCGCTGGTMCACCGACGCMCCTATATCCAAGAAAACG 

GTGCGTACTAACGACGACGGCTTAGCGTTCAACGACGCCAGCGCACAGGGTGTTGGCGCTACAGCTATAGGTTATAACTCTGTCGCCA 

AAGGCGATAGCAGCGTAGCTATTGGTCAGGGCAGCTACAGCGACGTTGATACGGGTATCGCCCTGGGTAGCAGCTCTGTTTCCAGCCG 

AGTGATTGCCAAAGGCTCCCGTGACACCAGCATAACGGAAAATGGCGTTGTTATTGGTTACGACACCACGGATGGCGAACTGCTCGGT 

GCATTGTCTATCGGTGATGACGGTAAATATCGTCAAATCATCAACGTAGCCGATGGTTCCGAAGCCCATGACGCCGTTACGGTTCGTC 

AATTGCAGAATGCGATTGGTGCGGTCGCAACCACGCCGACTAAATACTTCCACGCTAATTCAACGGAAGAAGATTCACTGGCAGTGGG 

AACTGACTCGCTGGCAATGGGTGCGAAAACCATCGTGAATGGCGATAAAGGTATTGGTATCGGTTATGGTGCCTACGTGGACGCGAAT 

GCACTTAACGGCATTGCCATTGGTAGCAATGCGCAAGTCATTCATGTCAACAGTATTGCGATAGGTAATGGTTCTACGACCACTCGTG 

GCGCTCAAACCMTTATACCGCCTACMCATGGACGCACCGCAGAACTCTGTCGGTGAATTCTCAGTCGGTAGTGCGGATGGTCAACG 

TCAGATCACTAACGTCGCAGCAGGTTCGGCTGATACCGATGCGGTCAACGTGGGTCAGTTGAAAGTAACGGATGCGCAGGTTTCCCAG 

AATACCCAGAGCATTACTAACCTGGATAATCGGGTAACGAATCTTGATTCACGCGTCACCAATATCGAAAACGGTATTGGCGATATCG 

TCACCACCGGTAGCACCAAGTACTTCAAGACCAATACCGATGGTGTAGATGCCAGCGCGCAGGGTAAAGATAGCGTCGCGATTGGTTC 

CGGCTCCATTGCTGCCGCTGACAACAGCGTCGCTCTGGGTACAGGGTCTGTGGCAACCGAAGAAAATACGATCTCTGTAGGTTCCTCT 

ACTAACCAACGTCGTATCACCAACGTAGCTGCAGGTAAAAATGCTACCGATGCTGTTAACGTGGCACAGTTGAAGTCTTCCGAAGCTG 

GCGGTGTACGTTACGACACCAAAGCTGATGGTTCTATCGACTATAGCAATATCACCCTCGGTGGCGGCAACGGCGGTACGACTCGTAT 

CAGCAACGTCTCCGCTGGCGTC^CAACMC 

GATCAGCGAATGGTTGAGATGGATAACAAACTGTCTAAAACTGAAAGCAAGTTGAGCGGTGGTATCGCTTCTGCAATGGCAATGACCG 
GTCTGCCGCAGGCTTACACTCCAGGTGCCAGCATGGCCTCTATTGGTGGCGGTACTTACAACGGTGAATCGGCAGTTGCTTTAGGTGT 
ATCGATGGTGAGCGCCAATGGTCGTTGGGTCTACAAATTACAAGGTAGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGTGCC 

GGTATTCAGTGGTAA 

SEQ ID NO: 28 (EAEC) 

ATGAACAAAATTTTTAAAGTTATCTGGAATCCGGCAACAGGCAGTTACACCGTTGCCAGTGAAACGGCAAAAAGCCGTGGCAAGAAAT 
CCGGGCGCAGTAAGCTGTTAATTTCTGCCCTGGTTGCTGGGGGAATGTTGTCGTCGTTTGGGGTACAGGCTCAGGCAGGAAGGGACAA 
CGGTCAGGGCGTCAATTATGGCCAGGGAACAGGAACAGGCTGGGTTGCTATCGGTGAAGATGCAAAAGCAAATAGTTTTACAGACACC 
GGGGGTGGCTCCAGCACCGCAGTAGGTTATCACTCAACTGCTGACGGAAGGTGGTCAACAGCCCTCGGTGCTAAGACCCATTCGCTTG 
GCGAAGCATCAGTGGCGTTAGGGATTAACACTACCTCAGCAGGTGAGCGAAGCCTCGCTATAGGTGCATCGGCAACGTCAACGGGTGG 
ATTTTCAATCGCACTTGGTCGTTATGCGAACTCAGTAGGTGAATTTTCTATTGCACAAGGGGATCACGCAGAGACTGGTGCTGATGAC 
GCCATCGCATTTGGACGTGAAAGCAAGGCCTTAGGAATTATGAGTATCGCCCTCGGAGCCACGGCTAATGCGTCGAAAGAATATGCAA 
TGGCGCTGGGTGCAAGTAGCGCCGCATCTGCAGCTAACGCGATTGCAGTAGGCCGCAACAGTGCTGCAGCTGGCGTAGACAGCCTCGC 
ATTCGGCAGACAATCCGCCGCCAGCGCCGCAAACGCGATTGCTATGGGTGCAGAGAGCAAAGCTGCTGAAAATGCGACGGCAGTTGGT 
ACTAACGCGGAGGCAAATGGACTTAATAGTATCGCGTTGGGCTCCGGCAGTATTGCAGATGTCGACAACACGATTGCTCTGGGCAATC 
AMGTCAGGCAGTAGCGGCTGGCGCAATTGCCATCGGTCAAGGGAATAAAGCTGACGGCGCAAATGCTATCGCGCTGGGTAATGGTAG 
TATTACAGGTGGTGTAAATGCTATTGCTCTTGGACAAGGCAGTTATGCCGGTTTAGAAAATGGCACTGCAATTGGTGCTCAAGCCAGT 
GCTCAGGGGAAAAATTCAGTTGCTCTGGGTGCTGGTTCTGTAGCGACTGACGCGGATACTGTTTCTGTGGGTAACACAACAGCTGAGC 
GACAAATTGTCAATATGGCAGCAGGTGATATCAGCACTACCAGTACTGATGCCATCAATGGATCACAGCTTTATGCTATCAGTAAGTC 
AGTAGCGGACAATCTTGGTGGGGGGGCTACCGTCAATGCGCAAGGCGTCGTTACTTCCCCAAATTACAGGCTGAAAAGTGGTATTTTC 
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GGCACTGTTGGCGACGCCTTMCGGGCCTGGACMTMTACGTTACMTGGGACTCCCTTAAAAAGGCATATAGTGCGGCACATGGTA 
CAGATACTACCAGTACCATCACCAACGTTAAAGACGGCGCTATTTCTGATACCAGTAAGGATGCGGTTAACGGTTCTCAGCTGAAAAC 
CACCMCGATMCGTAGCGACCMTACTGCCMTATCACC^^ 
AMGACGATGCCCTGCTGTGGMTGGCACCGCGTTCAGCGCCGCGCACGGCM 

GTGACCTGACGGCTGGCAGCACCGACGCGGTTMCGGCTCTCAGCTCAAAACCACTMCGATAATGTGGCAACCMCACCACCAACAT 
CACCAACCTGACGGATTCCGTTGGCGACCTTAAAGACGATGCCCTGCTGTGGAATGGCACCGCGTTCAGCGCCGCGCACGGTACCGAT 
GCCACCAGCAAAATCACCMCGTCAAAGACGGTGACCTGACGGCT^ 

ACGATGCCGTGGCAGCCAACACCACCAATATCGCCACGAATACCACCAACATCACCAACCTGACTGACGCAGTTGACAGCCTCGGTGA 
CGATTCCCTGCTGTGGAACGCTACAGCGGGGGCATTCAGCGCCGCACACGGTACTGATGCCACCAGCAAAATCACCAACGTCACCGCT 
GGCGACCTGACGGCTGGCAGCACCGACGGGGTTAACGGCTCTCAGCTCAAAACCACTAACGATGCCGTGGCAGCCAACACCACCAATA 
TCGCCACGAACACCACCAACATCACCAACCTGACTGACGCTGTTGACAGCCTCGGTGACGATTCCCTGCTGTGGAACGCTACTGCGGG 
GGCATTCAGCGCCGCGCACGGTACTGATGCCACCAGCAA 

GTTMCGGCTCTC^GCTCAAMCCACTMCGATGCCGTGGCAGCCMCACCACCAATATCGCCACGAACACCACCAACATCACCAACC 
TGACTGACGCTGTTGACAGTCTCGGTGACGATTCCCTGCTGTGGAACGCTACGGCGGGTGCATTCAGTGCCAAACACGGCACCAACGG 
TACTGACAGCAAAATCACCAACTTACTGGCAGGCACTGTATCCTCTGATAGCACTGACGCTATTAATGGCTCACAACTTTATGGCTTA 
GCTGATTCATTTACGTCATACCTTGGCGGTGGTGCTGATATCAGCGATGCGGGTGTATTAACCGGGCCAACCTACACTATTGGTGGTA 
CTGATTACAATAACGTCGGTGATGCTCTGGCTGCCATTAACACGTCATTTAGCACATCACTCGGTGACGCCCTACTCTGGGATGCGAC 
CGCCAAGGGCGGTGACGGTGCGTTTAGCGCCGGTCGCGGGACAGATAACACAGCCAGCATCATTACTAATGTTGCAGACGGTGCAATT 
TCCTCTACCAGCAGCGACGCTATTAACGGCTCACAGCTCTATGACACCAGCAAGTACATTGCCGATACTCTGGGTGGTGACGCAGAAG 
TCAATGCTGACGGCACAATCACCGCACCGACTTATGCCATCGCTGGCGGCAGTTACAGCAACGTCGGTGACGCGCTGGAAGCGATCGA 
TACCACGCTGGATGACGCTCTGTTGTGGGATGCAACAGCCAATGATGGCAATGGTGCATTTAGCGCCGCTCACGGAAAAGATAAAACA 
GCCAGTGTMTCACTMCGTCGCTAACGGTGCMTCra 

AGTACATTGCCGATGCCCTGGGTGGTGACGCAGAAGTTAACGCTGATGGTTCTATTACTGCGCCGACTTACACCATTGCAAATGCCGA 
GTACAACAACGTCGGTGACGCCCTGGATGCGCTCGACGATAACGCTCTGCTGTGGGATGCAACAGCCAATGACGGCGCAGGTGCCTAC 
AACGCCAGCCATGACGGCAAGGCCAGCATCATCACAAATGTTGCTGATGGTAACATTGGCGAAGGCAGCACTGACGCCATCAACGGTT 
CTCAGCTGTTTAACACCAATATGCTGATCCAGCAGAACAGCGAAATCATCAATCAGCTTGCCGGTAACACCAGCGAAACCTACATCGA 
AGATAACGGTGCGGGCATTAACTATGTACGTACCAACGACAACGGCTTAGCGTTCAACGATGCCAGCGCTTCAGGTATTGGCGCTACG 
GCTGTGGGTTATAACGCTGTCGCCTCAGGCGAMGCAGCGTAGCCATTGGTCAAGGTAGCAGCAGCAACGTTGATACGGGTATCGCCC 
TGGGTAGCAGTTCCGTTTCCAGCCGTGTAATAGTTAAAGGTTCTCGTGACACCAGCGTGTCGGAAGAAGGTGTTGTGATTGGTTATGA 
CACCACGGATGGCGAACTACTTGGTGCGTTGTCTATTGGTGATGACGGTAAATATCGTCAAATCATCAACGTAGCCGATGGTTCCGAA 
GCCCATGACGCCGTTACGGTTCGCCAGTTGCAAAATGCCATTGGTGCAGTCGCTACCACGCCGACCAAATACTTCCACGCCAACTCAA 
CGGMGMGATTCACTGGCAGTAGGTGMGACTCGCTAGCAATGGGCGCGAAAACTATCGTTAATGGTAATGCGGGTATTGGTATCGG 
TTATGGTGCCTACGTGGACGCGAATGCACTTAATGGCATTGCTATCGGTAGCAACGCGCGTGCAAACCATGCAAACAGCATTGCTATG 
GGTMTGGCTCACAGACGACTCGTGGTGCTCAAACTGGCTACGCCGCCTACAACATGGACGCACCGCAGAACTCTGTGGGTGAGTTCT 
CTGTCGGCAGTGAAGACGGTCAACGTCAGATCACCAACGTCGCGGCTGGTTCGGCTGATACCGATGCGGTTAACGTGGGTCAGTTGAA 
AGTAACGGACGCGCAGGTTTCCCAGAATACCCAGAGCATTACTAACCTGAACAATCAGGTCACTAATCTGGATACTCGCGTTACTAAT 
ATCGAAAACGGTATTGGCGACATTGTAACCACCGGTAGCACCAAGTACTTCAAGACCAACACCGATGGCGTAGATGCCAACGCGCAGG 
GTAAAGATAGTGTTGCGATTGGTTCTGGTTCCATTGCTGCCGCTGACAACAGCGTCGCGCTGGGTACAGGCTCCGTGGCCAACGAAGA 
AAATACCATCTCTGTGGGTTCTTCTACCAACCAGCGTCGTATCACCAACGTTGCTGCAGGTGTTAATGCCACCGATGCGGTTAACGTT 
TCGCAGCTGAAGTCTTCTGAGGCAGGCGGCGTTCGTTACGACACCAAAGCTGATGGCTCTGTAGACTACAGCAACATCACTCTCGGTG 
GTGGTAATGGCGGTACGACTCGCATCAGCMCGTCTCCGCTGGCGTGMCMCMCGACGCAGTGAACTATGCGCAGCTGAAGCAAAG 
CGTGCAGGAAACGAAGCAATATACCGATCAGCGGATGGTTGAGATGGATAACAAACTGTCCAAAACCGAMGCAAGTTGAGCGGTGGT 
ATCGCTTCTGCMTGGCAATGACCGGTCTGCCGCAGGCTTACACTCCAGGTGCCAGCATGGCCTCTATTGGTGGCGGTACTTACAACG 
GTGAATCGGCTGTTGCTTTAGGTGTGTCGATGGTGAGCGCCAATGGTCGTTGGGTCTACAAATTACAAGGTAGTACCAATAGCCAGGG 
TGAATACTCCGCCGCACTCGGTGCCGGTATTCAGTGGTAA 

SEQIDNO: 29 (Burkholderia fungorum) 

ATGAACAAGACATATCGCTCCGTCTGGAACGAGTCGACAGGCACTTGGGTCGCTGCGTCGGAACATGCAAGTGCACGCGGGAAAAAAT 
CTTCTGCGAAAACCTCTTCCACTAAGGCAGTCGTCGGGGCGTTGGGTCTGGCAGCCGGGCTATATGGTGCCGACGCGTTTGCGCTGGG 
TGGAGGGCTAACCCTTTGTCCGACCACCGAAGGATCCGCCGGGTACACTGCCGGTAGCGCTTCTAGCGCAAACGGGGCTTACTGCGGG 
TCCGACTATCAATGGGGTTTGTTCAGCMCACCAATGCCGACGGCAGTAAAAGCGGCCAGCCGATTGGTGCCGCGATTGAGGGAATGA 
ATGACGGCTCATTGCTGTTGTACGGTCCCAACAACATTGTGATGAAGAACCTGGTCAGCATGTCCAGCAACAAGATCATCAATCTTGC 
CCCTGGGACAGTGAGCTCTACCAGCGCAGACGCGGTAAACGGCTCGCAGTTGTATGCGACGAACCAGAACGTGTCGAACATCGGAAAC 
ACGGTGAACAACATCACCACCGGTGCCGGCATCATGTACTTCCACGTGAACTCGACGCTGGCGGATTCGACCGCGAACGGCGTGAATT 
CTATTGCTATCGGTGGCGCGACCAGAACGGACGCGAACAATTCGATTTCGATCGGTACGGGACTGACGCAAGCCTCAAGCAATACAGG 
GGCTATCGCAATTGGTCAGAATGCGAGCATCAATGTATACGGCGCGAATAGTATCGCTATCGGCACAAACAGTGCGACGGGTGGCATT 
GGCGGTGCGATTGCGCTCGGCGAGAACGCCTTTGCAACCGGCGGCAAGATGTTGGCGCTCGGCTCCGGTGCGAGTGCGACGACGGCTA 
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ATTCGGTCGCGCTAGGCTCAGGTTCGACGACGACGGCGMTTTGACAGCAGCAGGATATMCCCTGGCAGCGGCACGCTTGCCGGTAC 
GTCACAGGCTACGAATGGCGAAGTGTCGGTGGGCAATGCAGGTGCAGAGCGTCGTATCACTAACGTTGCGGCCGGTTCGGCAGCCACG 
GATGCGGTGAACGTGAGCCAGTTGCAATCGGAAGATGCAAAGGTGAACACGATCAACAACAACGTGAACAACCTGAGCGGCAGTGTCA 
CCMCATCAGCAGCACGGTGMCAACATCACCAACGGTGGTGGCATCAAGTATTTCCACGCGAACTCGACGCAGGCGGATTCGTCGGC 
CACGGGCACGGATGCAGTGGCGATCGGTGGCAATGCCCAGGCGACGGCGGCGAACTCGGTGGCGCTGGGTTTGAACTCGACGAGTAAG 
GGAACGAACGCGATTGCACTGGGCGGGGCCGTGGCTGGGGGGAGCTACGCTTTCGCGGCAGGCAGCCTGGCATTGGCTGCAACTACTG 
GTGACATTGCGCTGGGTTCATCGGCGACGGOT 

CGCAACCGATGCAACTGCCATTGGCGAAGGTGCCTCGGCAACAGCAGCATCCTCGGTAGCGTTGGGCGCGAGATCCAAAACGACAGCG 

AACCTGTCGACAGCGGGCTATAACCCGGGTACGGGGACGTTGTCCGGTACTACCCCGACGGGCGAAGTGTCGGTGGGCTCGGCAGGCA 

AGGAACGACGCGTGACGAACGTGGCAGCCGGCTCGGCAGCGACGGATGCGGTGAACGTGAGCCAGTTGATGTCGGAAGACGCAAAGGT 

GMCACGATCMCMCAACGTGMCMCCTGAGCAACMCGTCACGMC^TCGCGGGCMTGTCACCAACATCAGCAACACGGTGM 

AACATCACCAATGGTGGTGGCATCAAGTACTTCCACGTGAACTCGACGCTGGCGGATTCTTCCGCGGGGGGAAG^AATTCTATAGCTA 

TCGGCGGCGGGGCTACTACGGGGAATGTTACTGCAGGMCTTC^GAC^ 

AAATATTGCCATCGGCGGCAATGCGCAGGCCTTGGGAGGCGCATACGACGGTGGCTACAACACTGCGATTGGTGAGAACGCGATTGCA 
AAGGGAGATGGAGCAGGTGGCTTCGGCGGAGGGGGGTGGGGGCAGACCACGGCCATTGGCGGTGGTTCGCAGGCATTGCATGACAACA 
CCACGGCGGTCGGTTCAGGCGCGATCGCAAATGTGGCTAATGCTACTGCTTTGGGTATGTCTGCCAGCGCGACCGCTGGGAGTGCTAT 
CGCGCTGGGTCAGGGGGCAGTGGCGTCGGCAGCGAACTCGGTGGCACTGGGTTCGGGTTCGACGACGACGNCGAACCTGTCGGCAGCG 
GGTTATAACCCGGGTACGGGTACATTGTCGGGCATCGCTTCGGTAGCCMTGGCGAAGTGTCGGTGGGTGCAGCAGGCMGGAACGTC 
GCATCACCAACGTGGCAGCCGGCTCGGCAGCCACGGATGCAGTGAACGTGAGCCAGTTGCAATCGGAAGATGCAAAGGTGAACACGAT 
CMCMCMCGTGMCAACCTGAGCGGCAGTGTCACCAACATCAGCAACACGGTGAACM 

TTCCACACGAAATCGACGCTGGCCGATTCGTCGGCGACGGGTACGGATGCAGTGGCGATCGGCGGCAATGCCCAGGCGACGGCGGCGA 
ACTCGGTAGCACTGGGTTCGMCTCGACGACGACAGCGAACCTGTCGGCAGCGGGTTATAACCCGGGCACGGGTGCGTTGTCTGGCAT 
CGCTTCGGCAGCCAACGGTGAAGTGTCGGTGGGTGCAGCAGGCAAGGAACGCCGCATCACTAACGTAGCAGCCGGCTCGGCAGCCACG 
GATGCGGTGAACGTCAGCCAGCTCCAGTCGGAAGACGCAAAGGTGAACACGATCAGCAACAACGTGAACAACCTGAGCGGCAGCGTCA 
CCMCATCAGCAGCACGGTGAACAACATCACCAATGGTGGTGGCATCAAGTACTTCCACACGMCTCGACGCTGGCGGATTCGACCGC 
GAACGGCGTGAATTCTATTGCTATCGGTGGCGCGACCAGAACGGACGCGAACAATTCGATTTCGATCGGTACGGGACTGACGCAAGCC 
TCAAGCMTACAGGGGCTATCGCAATTGGTCMMTGCGA 

CGACGGGTGGCATTGGCGGTGCGATTGCGCTCGGCGAGAACGCCTTTGCAACCGGCGGCAAGATGTTGGCGCTCGGCTCCGGTGCGAG 
TGCGACGACGGCTAATTCGGTCGCGCTAGGCTCAGGTTCGACGACGACGGCGAATTTGACAGCAGCAGGATACAACCCGGGCAGCGGC 
ACGCTTGCCGGTACGTCACAGGCTACGAATGGCGAAGTGTCGGTGGGCAATGCAGGTGCAGAGCGTCGTATCACTAACGTTGCGGCGG 
GCTCGGCAGCCACGGACGCGGTGAACGTGAGCCAGTTGCAGTCGGAAGATGCGAAGGTGAACACGATCAACAACAACGTGAACAACCT 
GAGCAACAACGTCACAAACATCGCGGGTAACGTCACCAACATCAGCAACACGGTGAACAACATCACCAATGGTGGTGGCATCAAGTAT 
TTCCACACGAMTCGACGCTGGCCGATTCGTCGGCGACGGGCACGGATGCAGTGGCGATCGGCGGCAATGCCCAGGCGACGGCGGCGA 
ACTCGGTAGCACTGGGTTCGAACTCGACGACGACAGCGAACCTGTCGGCAGCGGGCTACAACCCGGGCACGGGTACGCTGTCCGGTAC 
TACCCCGACGGGCGAAGTGTCGGTGGGCTCGGCAGGCMGGAACGTCGCGTGACGMCGTGGCAGCCGGCTCGGCGGCGACGGATGCG 
GTGAACGTGAGCCAGTTGCAGTCGGCCATCATCGGCAGCACCGCGAACGCGGTCGCCTATGACGACGGCACGAAGGCCACGGTTACGC 
TGAAGGGCGCGAGCGGCACGAAGATCACCAACCTGACGGCGGGCAATCTGAGCGCAACGAGCACGGACGCGGTGAACGGCTCGCAGTT 
GTACGCGACGAACCAGAACGTGTCGAACATCGGTAACACGGTGAACAACATCACCAACGGTGGTGGCATCAAGTATTTCCACGCGAAC 
TCGACGCAGGCGGATTCGTCGGCCACGGGGTCGAACTCGGTCGCGGTGGGCGACAGGGCGTCGTCGCTTGGCGGTTCGTCGGTCGCGA 
TGGGTGACGGCGCGACAGCAGTGGGCGCGGCGAGTATTGCGATTGGTAACAATGCGCAGAACGTGACCGGGTCGAACAATTCAGTTGC 
CATTGGAGGTGATTCGAAGGCGGGCGACAGGTCCGTTTCACTGGGCAACGGTGCAGATACCTCGCTGTCGAGCTGGGGCGTTGCAGTC 
GGTACCAACGCGAACGTTTCTGCTGCACTGGGCACGGCAATTGGTGCCGGGGCCAACGTGAGCGGCGCGAATTCGACGGCCATCGGCG 
CGAACGCGGTGGCCTCTGCGACCAACTCGGTGGCGCTGGGTTCGAATTCGACGACGACGGCGAACCTGTCGGCTGCGGGTTATAACCC 
GGGTACGGGTACGTTGTCGGGCATCGCGTCGGCGGCCAATGGCGAAGTGTCGGTCGGTGCAGCAGGCAAGGAACGTCGCGTGACGAAC 
GTGGCAGCCGGCTCGGCAGCCACGGATGCGGTGAACGTGAGCCAGTTGCAGTCGGAAGATGCGAAGGTGAACACGATCAACAACAACG 
TGMCMCCTGAGCGGCAGTGTCACCMCATCAGCAGCACGGTGAACAACATCACCAACGGTAGCGGCATCAAGTACTTCCACACGAA 
CTCGACGTTGGCGGATTCGTCGGCTGGGGGGGCCAATAGCATTGCTATCGGTGGCGGGGCGGCGACCAGTAGCAGTGCCGGCCTGAGT 
GACAACATGGCGATCGGCACCAACGCCACCGCAAGCTACGGCAAGAACATCGCCATCGGCGGCGGTGCGCAGGCTACAGGTGGCACGT 
ACGACGGCGGCTATAACGTCGCACTCGGTGAAAACGCTAATGCCACAGCAGGCACGAACGCCTGGGGACACAATACGGCGATCGGTGC 
CAACACCGTAATCAATGGCGTCAACTCGGTAGCGCTCGGGATCTCGGCGACGACGAGCGGGAGCGGCAGCATGGCCTTTGGCTCGGCG 
GCACAGGCAAGCGCGGACTATGCGATTGCAAGTGGGGCAGGCGCCAATGCATCGGCTGTGAATTCCGTGGCGCTGGGTTCGAACTCGA 
CGACGACGGCCAACCTGTCGGCAGCGGGTTATAACCCGGGTACGGGTACGCTGTCGGGCATCGCTTCGGTAGCCAATGGCGAAGTGTC 
GGTGGGCTCGGCAGGCMGGMCGTCGCGTGACCMCGTGGCAGCCGGTTCGGCAGCGACGGATGCGGTGAACGTCAGCCAGCTCCAG 
TCGGAAGACGCCAAGGTGAACACGATCAACAATAACGTGAACAACCTGAGCAACAACGTCAGCAACATCGCGGGCAACGTCACCAACA 
TCAGCMCACGGTGMCMCATCACCAACGGTGGCGGCGGCATCAAGTACTTCCACGCGAACTCGACACTCGCCGATTCGTCGGCAAC 
GGGCACGGATGCAGTGGCGATCGGCGGGAATGCCCAGGCGACGGCGGCGAACTCGGTAGCACTGGGTTCAAACTCGACGACGACGGCA 
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MCCTGTCGGCAGCGGGCTACMCCCGGGCACGGGTACGCTGTCCGGTACGACCCCGGTGGGCGAAGTGTCGGTGGGCTCGGCAGGCA 
AGGAACGTCGCGTGACAAACGTGGCGGCCGGCTCGGCGGCCACGGATGCGGTGAACGTGAGCCAGTTGCAGTCGGCCATCATCGGCAG 
CACCGCGAATGCGGTCGCCTATGACGACGGCACGAAGGCCACGGTTACGCTGAAGGGCGCGAGCGGTACGAAGATCACCAACCTGACG 
GCAGGTAATCTGAGCGCAACGAGCACGGACGCGGTGAACGGCTCGCAGTTGTATGCGACGMCCAGAACGTGTCGAATGTCGGTAACA 
CGGTCAGTAACCTGAGCMCMCGTCACGMCATCGCGGGTMCGTCACCMCATCAGCAACACGGTGMCAACATCACCAATGGTGG 
TGGCATCAAGTATTTCCACGCGAACTCGACGCTCGCCGATTCGTCGGCGACGGGCACGGATGCAGTGGCGATCGGTGGCAATGCCCAG 

gcgacggcagcgaactcggtggcgctgggttcaaactcgacgacgacggcgaacctgtcggcagcgggctataaccctggcacgggtg 

cgttgtccgctacgaccccggtgggcgaagtgtcggtgggctcggcaggcaaggaacgtcgcgtgacgaacgtggcagccggctcggc 

agcgacggatgcagtgaacgtcagccagttgatgtccgaagatgccaaggtgaacacgatcaacaacaacgtgaacaacctgagcaac 

mcgtcagcaa^tcgcgggtaacgtcacc^ 

acgcgmctcgacgctggcggattcgtcggcmcgggcgttgac^ 

ggtagcactgggttcgaactcgacgacgacagcgaacctgtcggcagcgggttataacccgggcacgggtgcgttgtctggcatcgct 
tcggcagccaacggtgaagtgtcggtgggtgcagcaggcaaggmcgccgcatcactaacgtagcagccggctcggcagccacggatg 
cggtgaacgtcagccagctccagtcggaagacgcgaaggtgaacacgatcaacaacaacgtgaacaacctgagcaacaacgtcagcaa 
catcgcgggcaacgtcaccaacatcagcaatacggtgaacaacatcaccaacggtggcagcggcatcaagtacttccacgcgaactcg 
acactcgccgattcgtcggcmcgggcacggatgcagtggcgatcggtgggaatgcatcggcatcggcggcaaactcggtggcgctgg 
gttcgmctcgacgacgacggcgaacctgtcggcagcgggatacaacccgggttcggcagcactgtcgggcacggcctcggcagc^ 
cggcgaagtgtcggtcggtgcagcaggcaaggaacgccgcatcacgaacgtagcagccggctcggcagccacggatgcggtgaacgtg 
agccagctccagtcggaagacgcgaaggtgaacgctgaaggcgcggccactgcggcagcgctgggcggcggttcgacctacaacacga 
cgacgggtgcgatcaccagtccgacgtacatcgcaggcggcaagacgttcaacaatgttggcgatgtagtcacgaacatcgacggccg 
tgttacgcagaactcgacggacatcacgaacctgactacgaccatcgacaacggcacgatcggtctggtgcagcaggctacgccgacg 
agcacgattacggtcgcgaaggacacgggcggcgcgacggtggatttccggggcacgggcaatgcaactcgcacgttgacgggcatca 
cggcgggtgagttgagcgcaacgagcacggatgcggtgaacggctcgcagttgtacgcgacgaaccagaacgtgtcgaacattgacaa 
cacggtcagtaacctgagcaacaacgtcacgaatatcgcgggcaatgtcaccaacatcagcaacacggtgaataacatcaccaacggt 
ggtggcggcatcaagtacttccacgcgaactcgacgctggcggattcgtcggcaacgggcgttgacgcagtggcgatcggcggcaatg 
cccaggcgacggcagcgmctcggtagcactgggttcgmct^ 

gggcacgttgtctggcatcgcttcggcagccaacggtgaagtgtcggtcggtgcagcaggcaaggagcgccgcgtgacgaacgtagca 
gccggctcggcagccacggacgcggtgaacgtgagccagttgcaatcggaagatgccaaggtgaacacgatcaacaacaacgtgaaca 
acctgagcaacaacgtcagcaacatcgcgggcaatgtcacgaacatcagcaacacggtgaacaacatcaccaacggtggcggcggcat 
caagtacttccacgcgaactcgacgctggcggattcgtcggcgacgggcaccaacagcctggcggccggaccggcagcggtggcatcc 
gccactgacgcagtcgcactgggcaacggcgcgaaggcaaccaacgccggcgcggtggcactcggcgcaggttcgacgacaacgaccg 
cggtggcaacctccggcacgacgattggcggcatcacctacacgttcgcaggcgtcgctccgtcgagcacggtgagcgtgggtgcggc 
gggtagcgagcgcacgatcacgaacgtggcggctggccgcctgagcgcgacgagcacggacgcggtcaacggcagcgaactgtttgca 
accaaccagcaggtgacgcgaaacaccgcggacatcaccaacctgacgaacaacatgaacatcggttcggtgggtctggtgcagcagg 
acgcgacgacacgcaccatcacggtcgcgaaggccaccgacggtacgcgggtcgacttcacgggcacggggggcgcgcgtcaattgac 
gggcgtggccgcaggcgcagtgaacgcgacgagcgtggatgcggtgaacggttcgcagctctacggtgtgtcgcagagcgtggcggat 
gcgatcggcggtggctcgaccgtgaacacggatggctccatctcggccccgacctacgttgtggacggcacgaccgtccacaatgcgg 
gcgacgcgatcagcmcctcgacaaccgtgtgacgcagaacaccaccgacatcagcacgatcaacaacacgctgaacagcatcaccac 
gggtgcgggcgtcmgtacgtgcatgtgmctcgacot 

gcgcaatcgcaggcggcgaactcggtggcattgggttcgaactcggtggcggatcgtgccaacacggtgtcggtgggcgcggctggtg 
cggagcgtcagatcaccaacgtggcggccggtacggcggacacg(^tgcggtgaacgtcgcgcagttgaaggcgtcgggtgtgat(^ 
tacggatggcacgaccaacgccgccgtcacgtacgaccacaacgcggacggctcggccaactacaacagcgtcacgatgggtaacggt 
gtagcgggcggcacgacgatccacaacgtggcggcaggctcggctgccgacgacgcggtcmcgtcagccagatgaacgcggcgatct 
ccagcgtctcgaacatcatcggctcggcgggtaatccgctgttcacagccgacggcaaccgtgacacggaagccgcggtcgccagcgg 

CACGCATGCCACGGCAATGGGCGCGAACGCGAAGGCAAGCGCGGCCAATTCGGTGGCGCTGGGTGCGAACTCGGTGGCGGACCGTGAG 
AATACGGTGTCGGTGGGCTCGGCGGGCAACGAACGCCAGGTCACCAATGTCGCGGCCGGTACGGCCACCACCGACGCCGTCAACGTCG 
GCCAGTTGAACCAGGCGATCGGCGCATCGATCGGCAATCTGCCGGCCGGCATGTCGGCGAAGGACTACACCGACCAGCAGATCAACGC 
GGTGCAGAACGGCGTGMCCAGGTTGCGAAGAACGCGTATGCAGGTATCGCGGCCGCTACGGCGCTGACGATGATTCCGGACGTCGAT 
CAGGGCAAGACGATCGCGGTCGGTGTCGGCGGCGGCTCGTATAAGGGTTCGCAGGCTGTGGCGCTCGGCATCTCGGCACGTATCACCC 
AGAACCTGAAGATGAAGGCCGGCGCGGGTACGAGCTCGCAAGGCACGACGGTGGGCCTGGGTGCTTCCTACCAGTGGTAA 

SEQ ID NO: 30 (EPEC) 

ATGCTGATCCAGCAGAACAGCGAAGTCATCAATCAGCTTGCCGGTAACACCAGTGAAACCTACATCGAAGAAAATGGTGCAAGTATTA 
ACTATGTGCGTACCAATGACACCGGTTTAACCTTCACCGATGCCAGCGCAGCAGGTATTGGCTCTACCGCTGTGGGGTATAACACTGT 
TGCCAAAGGCGATAACAGCGTGGCCATGGGTTATAACTCTTTTGCCGAAGGCCATAGTAGCGTGGCCATCGGTCAGGGCAGCTACAGC 
GGCGTTGAGACGAGTATTGCGCTGGGTAGCGAATCCGTCTCCAGCCGCGTGATTGTTAAAGGTTCTCGTAACACGAGCGTATCGGAAG 
AAGGTGTTGTGATTGGTTATGACACCACGGATGGCGAACTGCTTGGCGCATTGTCGATCGGTGATGATGGTAAATATCGTCAAATCAT 
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CMCGTCGCGGATGGTTCTGMGCCCATGACGCTGTTACTGTTCGCCAGTTGCAAAACGCCATTGGTGCAGTAGCAACCACACCAACC 
AAATACTATCACGCTAACTCAACGGCTGAAGACTCACTGGCAGTCGGTGAAGACTCGCTGGCAATGGGCGCGAAAACCATCGTTAATG 
GTAATGCGGGTATTGGTATCGGCCTGAACACTTTAGTTCTGGCTGATGCGATCAACGGTATTGCTATCGGTTCTAACGCACGCGCAZVA 
TCATGCCGACAGCATTGCTATGGGTAATGGTTCTCAGACTACTCGTGGTGCGCAGACCAACTACACTGCCTACAACATGGATGCACCG 
CAGAACTCTGTGGGTGAGTTCTCTGTCGGCAGTGAAGACGGTCAACGTCAGATCACCMCGTCGCAGCGGGGTCGGCGGATACCGATG 
CGGTTAACGTGGGTCAGTTGAAAGTAACGGACGCGCAGGTTTCCCAGAATACCCAGAGCATTACTAACCTGAACACTCAGGTCACTAA 
TCTGGATACTCGCGTGACCMTATAGAAAACGGCATTGGCGATATCGTMCCACCGGTAGCACCAAATACTTCMGACCMCACCGAT 
GGCGTAGATGCCAACGCGCAGGGTAAAGACAGTGTTGCGATTGGTTCTGGTTCCATTGCTGCCGCTGACAACAGCGTCGCGCTGGGTA 
CAGGCTCCGTGGCCAACGAAGAAAATACCATCTCTGTGGGTTCTTCTACCAACCAGCGCCGTATCACCAACGTTGCTGCCGGTGTTM 
TGCCACCGATGCGGTTAACGTTTCGCAACTGAAGTCTTCTGAAGCAGGCGGCGTTCGCTACGACACCAAAGCTGATGGCTCTATCGAC 
TACAGCAACATCACTCTCGGTGGTGGTAACGGCGGTACGACTCGCATCAGCAACGTTTCTGCTGGCGTGAATAACAACGACGCGGTGA 
ACTACGCGCAGTTGMGCAMGTGTGCAGGAMCGMGCAATACACCGATCAGCGCATGGTTGAGATGGATAACAAACTGTCCAAAAC 
TGAAAGCAAGCTGAGTGGTGGTATCGCTTCTGCAATGGCGATGACCGGTCTGCCGCAGGCTTACACGCCGGGTGCCAGCATGGCGTCT 
ATTGGTGGCGGTACTTACAACGGTGAATCGGCTGTTGCTTTAGGTGTGTCGATGGTGAGCGCCAATGGTCGTTGGGTCTACAAATTAC 
AAGGTAGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGTGCCGGTATTCAGTGGTAA 

SEQ ID NO: 31 (Shigella) 

ATGAACAAAATATTTAAAGTTATCTGGAACCCTGCGACAGGGAATTATACTGTTACCAGCGAAACGGCAAAAAGCCGTGGCAAGAAAT 

CTGGGCGCAGTAAGCTGTTAATTTCTGCGCTGGTTGCGGGTGGAATGTTGTCGTCGTTTGGGGCATTGGCGAATGCCGGGAATGACAA 

CGGTCAGGGTGTTGATTACGGTAGTGGATCAGCTGGCGACGGCTGGGTTGCTATAGGCAAAGGGGCGAAAGCAAATACTTTTATGAAC 

ACCAGTGGTTCCAGTACTGCTGTGGGTTATGACGCTATAGCTGMGGCCMTATAGCTCTGCCATCGGGTCAAAAACCCATGCGATTG 

GTGGTGCATCAATGGCCTTTGGGGTTAGTGCAATATCAGAAGGCGATAGAAGTATAGCACTGGGTGCCTCTTCGTATTCATTGGGCCA 

ATACTCAATGGCCCTCGGCCGTTATTCAAAAGCATTGGGTAAATTGTCTATTGCTATGGGGGACTCTTCCAAAGCGGAAGGAGCAAAC 

GCCATTGCCCTGGGGCGTAGCAGTGTAGCTAGCGGTACTGACAGCCTCGCATTTGGCAGACAATCACTTGCCAGCGCAGCGAACGCTA 

TTGCGATAGGTGCTGAGACCGAAGCCGCTGAAAATGCAACTGCTATTGGCAATMTGCGAAGGCAAAAGGGACTAATAGCATGGCMT 

GGGGTTCGGMGCCTTGCCGATAMGTCMTACTATCGCATTAGGAMTGGCAGCCAGGCTCTGGCAGATAATGCAATCGCCATAGGC 

CAGGGCAACAAAGCTGATGGCGTGGATGCCATCGCTCTGGGTAATGGTAGCCAGTCGAGAGGCTTAAACACCATTGCCTTAGGCACAG 

CCAGTAATGCAACTGGTGATAAGAGTCTTGCGCTTGGTAGTAATAGCAGTGCCAACGGTATTAACTCTGTCGCGCTGGGCGCAGATTC 

CATTGCGGATTTAGACAATACCGTCTCTGTCGGCAATAGTTCATTAAAACGCAAGATCGTTAATGTGAAAZVATGGCGCGATCAAGTCT 

GACAGTTACGATGCCATTAATGGTTCACAGCTTTATGCCATTAGCGACTCGGTAGCAAAAAAGGCTTGGAGGAGGGGCTGCAGTAGAT 

GTTGATGACGGTACTGTTACAGCACCAACCTACAATTTAAAAAATGGTAGCAAAAATAACGTAGGGGCTGCGCTCGCTGTACTTGATG 

AAAACACCCTGCAATGGGACCAAACCAAAGGCAAATACAGCGCTGCTCATGGTACTAGTAGCCCAACTGCCAGCGTAATCACCGATGT 

TGCGGATGGCACGATTTCAGCCTCCAGTAAGGATGCGGTTAACGGTTCCCAACTGAAAGCTACCAATGACGATGTCGAAGCCAACACC 

ACCAATATCACCAACCTGACGGATTCCGTTGGTGACCTTCAGGCTGATGCCCTGCTCTGGAACGAAACTAAAAAGGCATTCAGTGCAG 

CTCACGGCCAGGATACCACCAGCAAAATCACCAACGTTAAAGATGCCGACCTGACGGCTGACAGCACTGATGCTGTTAACGGCTCTCA 

GCTGAAMCCACCMCGATGCTGTGGCGACGMTACCACCMTATCGCCAATAACACTTCCAATATTGCCACTAACACCACCAACATC 

TCTAACCTGACTGAGACGGTGACTAATCTTGGTGAGGATGCGCTGAAATGGGATAAGGACAATGGTGTATTCACGGCAGCTCATGGCA 

CCGAGACCACCAGCAAAATAACCAACGTTAAAGATGGCGACCTGACGACTGGCAGCACCGATGCCGTTAACGGCTCTCAGCTGAAMC 

CACCMCGATGCCGTGGCGACGMTACCACCMTATCGCCACTMCACCACCAAC^TCTCTMTCTGACTGAGACGGTGACTMT 

GGTGAGGATGCGCTGAMTGGGATAAGGACAATGGTGTCTTCACTGCAGCTCATGGCMCMTACCGCCAGCAAAATCACCAATATCC 

TGGACGGCACAGTCACTGCAACCAGTTCCGATGCCATTAACGGTAGCCAGCTTTATGACTTAAGCAGCAATATCGCCACCTACTTCGG 

CGGCAATGCTTCTGTGAATACTGACGGTGTGTTTACCGGTCCAACCTACAAAATCGGTGAAACAAATTATTATAACGTCGGCGATGCA 

CTGGCTGCGATTAACTCCTCATTTAGCACGTCTCTCGGCGATGCTCTGCTTTGGGATGCCACCGCAGGTAAATTCAGTGCCAAACACG 

GTACTAATGGTGACGCAAGCGTGATCACTGATGTCGCAGATGGTGAAATTTCAGACTCCAGTTCTGACGCAGTAAACGGCTCACAACT 

CCACGGCGTGAGCAGTTATGTTGTTGATGCGCTGGGGGGTGGTGCCGAAGTCAATGCAGACGGCACCATCACTGCGCCGACGTACACC 

ATTGCTAATGCTGATTACGATAATGTCGGTGATGCCCTGAATGCTATCGATACCACTCCTGACGACGCTCTGCTCTGGGATGCGGACG 

CCGGTGAAAATGGTGCATTTAGCGCCGCTCACGGAAAAGATAAAACTGCCAGTGTAATCACTAACGTCGCTAACGGTGCAATCTCTGC 

TGCCAGCAGCGACGCGATTMCGGCTCACAACTCTATACCACCAATAAGTACATCGCTGATGCGCTGGGTGGTGACGCAGAAGTCMC 

GCTGACGGCACCATCACCGCACCGACTTACACCATTGCGAACGCCGAGTACAACAACGTCGGTGACGCCCTGGATGCGCTTGATGATA 

ACGCCCTGCTGTGGGATMGACTGCCAATGGCGGTGCTGGAGCCTACAATGCCAGCCATGACGGTAAAGCCAGCATCATCACTAATGT 

CGCTAATGGCAGTATTAGTGAGGACAGTACCGATGCAGTGAACGGTTCTCAGTTGAATGCGACGAATATGATGATTGAGCAGAACACC 

CAAATTATCAATCAGCTCGCTGGTAACACCGACGCAACCTATATCGAAGAAAATGGCGCTGGTATCAACTACGTTCGTACTAACGACA 

ACGATTTAGCCTTTAACGATGCAAGCGCCTCTGGTGTCGGCGCTACAGCTGTAGGTTATAACGCTGTCGCGTCTGGTGCCAGCAGCGT 

AGCCATTGGTCAGAACAGCAGCAGCACCGTTGATACCGGTATTGCGCTGGGTAGCAGCTCCGTTTCCAGCCGTGTGATTGCCAAAGGT 

TCTCGTGACACTAGCGTAACGGAAAATGGCGTGGTGATTGGTTATGACACCACTGACGGCGAACTGCTAGGTGCATTGTCAATTGGTG 

ATGACGGTAAATACCGCCAAATCATCAACGTAGCTGATGGTTCAGAAGCCCATGACGCCGTTACGGTTCGCCAGTTGCAGAACGCTAT 

TGGAGCGGTCGCCACTACGCCAACCAAGTACTTTCACGCCAACTCAACGGCAGAAGACTCACTGGCCGTTGGTGAAGACTCGCTGGCA 
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ATGGGTGCGAAAACTGTCGTTMTGGTAATGCAGGTATTGGTATCGGTTTGAACACTCTGGTTCTGGCTGATGCGATCAACGGCATTG 
CTATCGGCAGTAACGCACGCGCAAACCATGCAAACAGTATCGCAATGGGTAACGGTTCTCAGACCACCCGTGGTGCACAGACTGGCTA 
CACCGCCTACAACATGGACGCACCGCAGAACTCTGTAGGTGAGTTTTCTGTCGGTAGTGAAGACGGTCAACGTCAGATCACAAACGTC 
GCAGCTGGTTCAGCGGATACCGATGCGGTTAACGTGGGTCAGTTGAAAGTCACTGATGAGCGCGTAGCGCAAAATACCCAGAGCATTA 
CTAACCTGAACAATCAGGTCACTAATCTGGATACTCGCGTTACTAATATCGAAAACGGTATTGGCGACATTGTCACCACCGGTAGCAC 
CAAGTACTTCAAGACCAACACCGATGGCGTAGATGCCAACGCCCAGGGTAAAGATAGCGTTGCTATTGGTTCTGGTTCCATTGCTGCC 
GCTGACAACAGCGTCGCACTGGGTACCGGTTCCGTTGCAGAGGAAGAAAATACAATCTCTGTAGGTTCTTCTACTAACCAACGCCGGA 
TCACCAACGTAGCTGCCAGTGTTAATGCCACCGATGCGGTTAACGTGTCGCAGCTGAAATCTTCTGAAGCAGGCGGAGTGCGCTACGA 
CACCAAAGCTGATGGTTCTATCGACTATAGCAATATCACCCTCGGTGGTGGCAACGGCAGTACGACTCGTATCAGCAACGTCTCCGCT 
GGCGTCMCMCMCGACGCGGTGAACTACGCGCAGTTGAAGCAM 

AGATGGATMCAAACTGTCTAAMCTGAMGCMGTTGAGCGGTGGTATCGCTTCTGCMTGGCAATGACCGGTCTGCCGCAGGCT 

TACACCGGGTGCCAGCATGGCTTCTATTGGTGGCGGTACTTACAACGGTGAATCGGCAGTTGCTTTAGGTGTATCGATGGTGAGCGCC 

AATGGTCGTTGGGTCTACAAATTACAAGGTAGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGTGCCGGTATTCAGTGGTAA 

SEQ W NO: 32 (Brucella melitensis) 

ATGTCTTTTTTTAAAAAAAATATATCGATAACAGCCATGGGTGGCTTGATGTTGTCGCTTGCAGTCGACGCTGCGAAGGCGGAGGAAA 
ATGTTTCGCAGGTGAAACTTCCTCCTGTTTTTGTTTTTGAGCTTGTTGAAAACCAGGGGCTGGCCAATATTGCTCTGATTCGACCTCG 
AGTGATAGCGCCAGACAACAACCTAGGTCCTGGGGGCATTGTGTCTGGGATCGCTGGTCTTTTGACTCTCGGGCAGGAGAACCGCAAC 
CTAATTAGTGAAAATAGGCAAGTTATTAATAACAATACGACTGCCATTGGCCAGAATCGTACCTCGATTAGTACCAATGCCAAGGGCG 
TTGCCGACAATAGGGCTGCTATCAGACAGAATAGTGCTGCGATTTCCGCTCTTGGCCAGCGAGTTGACGGGTTGCAAGGTCAGATCAA 
CAGCGCCCGGAAAGAGGCACGTGCCGGAGCTGCAAATGCCGCTGCTCTTTCGGGGCTAAGATATGATAACCGTCCAGGCAAGGTGTCA 
ATCGCTACCGGTGTTGGCGGCTTTAAAGGTAGCACCGCTTTGGCTGCTGGTATCGGTTACACAAGCAAGAACGAAAACGCTCGTTATA 
ACGTGAGTGTAGCTTACAACGAAGCTGGCACATCCTGGAATGCAGGTGCAAGCTTTACGCTTAACTAA 

SEQ ID NO: 33 (Brucella suis) 

ATGTCTTTTTTTAAAAAAAATATATCGATAACAGCCATGGGTGGCTTGATGTTGTCGCTTGCAGTCGACGCTGCGAAGGCGGAGGAAA 
ATGTTTCGCAGGTGAAACTTCCTCCTGTTTTTGTTTTTGAGCTTGTTGAAAACCAGGGGCTGGCCAATATTGCTCTGATTCGACCTCG 
AGTGATAGCGCCAGACAACAACCTACGTCCTGGGGGCATCGTGTCTGGGATCGCTGGTCTTCTGACTCTCGGGCAGGAGAACCGCAAC 
CTAATTAGTGAAAATAGGCAAGTTATTAATAACAATACGACTGCCATTGGCCAGAACAGCGATCGGATCGACGCCAATGCCAAAGGTG 
TTGCCGACAATAGGGCTGCCATTGGCCAGAACAGCGGCCGGATCGACGCCAATGCCAAGGGCGTTGCCGACAACAAGGCTGCCATTGG 
CCGAMCAGCGGTCGGATCGACGCCMTGCCMGGGCGTTGCCGACMCMGACTGCCATTGGTCGAAACAGTGGTCGGATCGATACC 
MTGCCAMGGTGTTGCCGACMTAGGGCTGCCATCAGCC/yy\ACCGCGGTCGCATCMTGCTAACGCAGCAGGTGTTGCTAGCAATA 
GGGCTGCTATCAGAC7\GMTAGTGCTGCGATTTCCGCTCTTGGCCAGCGAGTTGACGGGTTGCMGGTC^GATCMCAGCGCCCGGAA 
AGAGGCACGTGCCGGAGCTGCAAATGCCGCTGCTCTTTCGGGGCTAAGATATGATAACCGTCCAGGCAAGGTGTCAATCGCTACCGGT 
GTTGGCGGCTTTAAAGGTAGCACCGCTTTGGCTGCTGGTATCGGTTACACAAGCAAGAACGAAAACGCTCGTTATAACGTGAGTGTAG 
CTTACAACGAAGCTGGCACATCCTGGAATGCAGGTGCAAGCTTTACGCTTAACTAA 

SEQ ID NO: 34 (Ralstonia solanacearum) 

GTGGTTTTCAGCGCCATGCCGCMTACGCTTGCGCAGAMTGTTGCTGCAMACGATCCGGGAACGAATTGTGGAAGCGTGGGTGATG 
CATATGCCTGGGCGCGAGGCGATGGGTATTCGGGTTGTAAGGTCGGTTACGAAGCCGCAAAAAATTTGGCAAAGGGCACAGCATTCGG 
CAATAGCCTGGGTCAACTATCGCCTGGTACGAACATTCTCGTCTACGGGTCGACGCTCCGTGCTGGTATGAATGACGAAGTGACACCC 
CTCGATTCGATGAATATCGGTGGCCATCTCGACGTATGGGGAGCCAGCGGCTTTCACGGCGGTGTCGATATGAATAATTCCGCCATCA 
AAAACCTTGCTGACGGCACGCTGTCGGCCACCAGCACCGAGGCGGTGACTGGCAGGCAACTGAACGCCACCAACACGAACATCACCAA 
TCTGCAGAACAGCATCAAGAGCATCAGCAGTTCGGCGAGCCTGGTGCAGCAGTCGGCCGCGGGCAAGGACATCACAGTGGCCAAGGAC 
CTGGACGGTGACGCGGTGGACTTCAGCGGCAAGAAGCTGAGTGACAGCACGACGTTTTCGCGCAAGCTGACGGGTGTGGCGGAGGGGA 
CGTTGTCGGCGACGAGCACGGATGCGGTGAGCGGCAAGCAACTCTATACGACCAACCAGAACCTGAGCACGACCAACCAGAATCTGGC 
GGACACGAACAAGAGCCTGGCCGAGACGAACAAGAACGTGTCGGCGACCACGACCAACATCACGAACCTGCAGAACACCATCAAGAAC 
ATCAGCGGCGGCTCGGCGGGTCTGGTGCAGCAATCGGCCGCGGGCAAGGACATCACGGTGGCCAAGGACCTGGACGGTGAGGCGGTGG 
ACTTCAGCGGCAAGAAGCTGAGCGACAGCACGACGTTCTCGCGCAAGCTGACGGGTGTGGCGGAGGGGACGCTGTCGGCGACGAGCAC 
GGATGCGGTGAGCGGCMGCAGCTCTATACGACGMCCAGMTCTGGCGAGTACCMCMGGACCTGGCCAATACCAACACGCGCCTG 
ACGACGGCCGAGGGCAATCTGTCGTCGAACACGACGAGCATCACGAACCTGCAGAACACCATCAAGAACATCAGCGGCGGCTCGGCGG 
GTCTGGTGCAGCAATCGGCTGCGGGCAAGGACATCACCGTGGCCAAGGACCTGGACGGTGACGCGGTGGACTTCAGCGGCAAGAAGCT 
GAGCGACAGCACGACGTTCTCGCGCAAGCTGACGGGTGTTGCGGAGGGGACGTTGTCGGCGACGAGTACCGATGCGGTGAGCGGCAGG 
CAGCTCTATACGACCAACCAGAACCTGAGCACGACGAACCAGAATCTGGCGGACACGAACAAGAGCCTGGCCGAGACGAACAAGAACG 
TGTCGGCGACCACGACCAACATCACGAACCTGCAGAACACGGTGAACAACATCAGCAGCGGTTCGGCGGGTCTGGTGCAGCAATCGGC 
AGCGGGCAAGGACATCACAGTGGCCAAGGACCTGGACGGTGACGCGGTGGACTTCAGCGGCAAGAAGCTGAGTGACAGCACGACGTTC 
TCGCGCAAGCTGACGGGTGTGGCGGAGGGGACGCTGTCGGCGACGAGCACGGATGCGGTGAGCGGCAAGCAGCTCTATACGACCAACC 
AGAACCTGAGCACGACCAACCAGAATCTGGCGGACACGAACAAGAGCCTGGCCGAGACGAACAAGAACGTGTCGGCGACCACGACCAA 
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CATCACGMCCTGCAGMCACGGTGMCMCATCAGCAGCGGTTCGGCGGGTCTGGTGCAGCAGTCGGCCGCGGGCMGGACATCACG 
GTGGCCAAGAACCTGGACGGTGACGCGGTGGACTTCAGCGGCAAGAAGCTGAGTGACAGCACGACGTTTTCGCGCAAGCTGACGGGTG 
TGGCGGAGGGGACGCTGTCGGCGACGAGCACGGATGCGGTGAGCGGCAAGCAGCTCTATACGACCAACCAGAATCTGGCGAGTACCAA 
CMGGACCTGGCCMTACCMCACGCGCCTGACGACGGCCGAGGGCMTCTGTCGTCGMCACGACGAGCATCACGAACCTGCAGAAC 
ACCATCAAGAACATCAGCGGCGGCTCGGCGGGTCTGGTGCAGCAATCGGCTGCGGGCAAGGACATCACCGTGGCCAAGGACCTGGACG 
GTGACGCGGTGGACTTCAGCGGCAAGAACCTGAGCGACAGCACGACGTTCTCGCGCAAGCTGACGGGTGTTGCGGAGGGGACGTTGTC 
GGCGACGAGTACCGATGCGGTGAGCGGCMGCAGCTCTATACGACmCCAGAACCTGAGCACGACGMCra 
MCAAGAGCCTGGCCMGACGMCAACMCGTGTCGGCGACCACGArc^ 
■ GCGGTTCGGCGGGTCTGGTGCAGCAGTCGGCAGCGGGCAAGGACATTACGGTGGCCAAGGACCTGGACGGTGACGCGGTGGACTTCAG 
CGGCAAGAAGCTGAGCGACAGCACGACGTTCTCGCGCAAGCTGACGGGTGTGGCGGAGGGGACGCTGTCGGCGACGAGCACGGATGCG 
GTGAGCGGCAAGCAGCTCTATGCGACGAACCAGAATGTGTCGAAGTTGTCGGCGAACGTCACGGACGTGAGTGATAGCGTCACCAACA 
TCAAGAACACGATGAACACCATCGTGAACGGCGGCGGGCTCAAGTACTTCCACGCGAACTCGACGCTGGACGATGCGCAGGCGATGGG 
CCTCGAGTCGATCGCGTTCGGCGGCGCAGCCGTCGCGGCCGGTATGAACTCGATGGCGATGGGCGGCAATGCCCGGGCGGTGGCGGGC 
AACGCTGTGGCCTTGGGCGCGGGTTCGGTGGCGGACCGCGCGAACACGGTGTCGGTGGGCTCGGCGGGCAAGGAGCGCCAGATCACCA 
ACGTGGCGGCCGGTACGGCGGACACGGATGCCGTGAACGTTGCCCAGCTGAAGGCGGCCGGCATCATCAACGGCAGCGGCAGGACCAA 
CGCCACGGTGACGTACGGCACCAACGCAGACGGCTCGGCGGACTACGGCAACGTGACGCTGGGCGGCGGCMCGCGCCGGCCGGCACG 
GCGATCCACAACGTCGCGGCCGGCACGGCCGAGACCGACGCGGTGAACGTCAGGCAGATGAACGCGGCCATTGCCAGCGTGCAGAAGG 
TGAGCAACACCAACGACCCGATGTTCGCGGCGGATGGCGACCGCGCTGTCAAGCGCGCGAGCGCCAAGGGCACGCATGCCACGGCGAT 
GGGTGCCGCGGCCAGCGCGGGCGGCGACCAGTCGGTCGCGACGGGCCACAACGCGCAGTCGGGCGGCGACAGCTCGGTCGCGATGGGC 
GCGAATGCGAAGGCGACGGCGAATCATGCGGTTGCCGTGGGCTCGGGTTCGGTAGCGAACCGCGCGAACACGATGTCGGTGGGCTCGG 
CGGGCAGCGAGCGCCAGATCACCAATGTTGCGGCCGGTGTGCAGGGTACCGATGCGGTCAACGTGAGCCAGCTGAGCCAGGCGGTCTA 
TGCGGCCGTCGGCGATCTGCCGGCGGGCACGACGGCCAGGCAGTACACGGATGAGCAGATCGGCATGGTGCGGCAGGGGATCAGCCAG 
GTGGCGCGCGGCGCTTACAGCGGTATCGCGGCGGCGACCGCGCTGACGATGATTCCGGACGTCGATCAGGGCAAGTCGATCGCGATCG 
GTATCGGCAGCGCGACCTACMGGGCTATCAGGCGGTTGCGCTGGGCGCCTCGGCACGCATCTCGCACAACCTGAAGGCCAAGATGGG 
CGTGGGCTACAGCAGCGAAGGCACGACGGTCGGCATGGGCGCGTCGTATCAGTGGTAA 

SEQ ID NO: 35 (Sinorhizobium meliloti) 

GTGGCGCTGGGACGCCAATCGGTCTCGGCCGGCAGCGGTTCACTTGCATTCGGTAACGGTAGTTACGCGAATTCAAACGGATCGGTTG 
CCATCGGGCAGTCTGCATACGCGGCCAATGTTCGGGCAATCGCAATCGGTGGAGATGATGCCTTCGCTTGGCGAGAGGCCGAGCAGAC 
CAAGGCCGGGGGCTCCCAGTCGATCGCCATGGGCGTGCGTGCGCGTACGAAATCGCTGGTAGTCGACGACCCGGATACGGTGGCCAAC 
GAAGCGGACCCGGGCGGCGCGTCTGATGCCATTGCAATCGGTACGGATGCACAGGCCMTGGCGACCGGTCTCTCGCCATCGGCAGAC 
AGAATCAGGCCGGCAATGAGCMTCCATCGGCATCGGCGCGGGCAACACGGCAACAGGCAAACTCTCGATCGGCATCGGCAGCAGCAA 
CGTGGCCAGCGGCGAGCAATCCCTAAGCCTCGGCGCCGGCAACAATGCCCTGGGGCAGGGCTCCATCAGCATCGGTACCGAAACCACA 
GCCGGCGGTCTCCGGTCGATCGCTTTCGGCGTGCGCGCGAGTACGAAAGAAGCCAATCTAGATATTCCGGATGACGTGGCCGCGATCG 
ACGCCATCGCTATCGGTACCAATACCAAGGCCAACGGCGACCGGTCCGTCAGCATCGGAACGGGCAGTCAAGCCAGCAGCGGAGCCGT 
CAGTATTGGCGATGCAGCCAAGGCTGTGGGTGACAAATCCGTCAGTATCGGTACCGAAAGCTGGGCCGATGGCGACGAATCGGTCAGC 
ATCGGCCTCGTCAACAACGCCGGGTTTGAAGGGAATGACCGAATCAAAGGCGGGCAAACCTCTGTCAGCCTGGGAGCCTTCAATCAGT 
CGCCGGGCATCGAGGCCATTGCTATCGGTGCCAGAAACGAAGCCAATGCGGATCGGTCGATTGCAATCGGCTCGCGTGCGAAAACGAA 
GGCCGCCGATCCGGCGCAGGCGGACGGCGGTGCGCGCGACGCCGTCGCTATAGGCACGGATGCGCTGGCCAACGACGACCGGTCCATC 
AGCATAGGCTGGAATAGCAGTACCTCCCTGAACGATTCCATCAGCATCGGTACCCGAGCCACGTCGGGATCGGCGGGCGATATCATGA 
TCGGCACAGGTTCAGGGACTGGTTCCACCTCCGGTCAGAACAATGTCGCCCTCGGCGTTGCGGCCAGTCAGAAGGTGAAGGGGTCGTC 
AAACATAGCGATCGGCGATTCGGCGGGCGGTTCCCGGGAAGGCGATAACAACGTCGCCATAGGCACCAATGCGGGAATCCAGTTTTCC 
GAGAGCGAACATGAGACCGCCGTGCGCGCCGACCTCGTGGTCAGTGACGCGGTGAGCATCGGCAATGAGGCGCTGGCGAGCGCCGATG 
AAGCCATCGCAATCGGCACCGGCGCCGTGGCTTCCGGTTTGAAGTCCATCAGCATCGGCGTCGGAAATACCGTCAGCGGCGCTTCGAG 
CGGCGCCATCGGCGATCCGACCGATATCACCGGTACCGGCTCCTACTCGCTGGGCAACGACAACACCATCGCCGCCGACAACGCCGGC 
ACCTTCGGCMCGACMCACATTGGCGGATGCCGCC^ 

TCGTGCTCGGCMTGGTGCCGACGTCACCGAAGTCGGCGGCGTGGCGCTGGGTTCCGGCTCGGTTTCGGATACGGGTGCCGACGTGGC 

GGGGTATGTGCCGGGCGGGGCCTCGACGGCCGACCAAAACGCCATCGAGGCGACGCAGAGCACGCGCGGCGCGGTCGCCGTCGGCAAT 

CCGGACGCCGAAACCGGCGTCTACCGCCAGATCACAGGTGTCGCCGCCGGCACGGCCGACTCCGATGCCGCCAACGTGGCCCAGCTCA 

MTCGGTCGAGACGATCGCCAAGAC^GGCTGGMGCTCACGACGGACAGCGGCAGCATCGACGGC^TCGGGCCGGGC 

CCTCAMGGCGGCGACGGCMTATCGTGATCAGCMTCAGATCT^ 

AACAGGGTGACGGCGAGAGATCCCGACACGGGTGCATCCACGGTGCTGGACGAGAACGGCCTGAGCTTCACGACGCAGGACGCGAACG 
GGGAGGACACGGCGCTCGGCCCGCGCGTGACGGCGGCGGGCATCCAAGCGGCCGGCAAAATCACCAATGTCGCTGCGGGCGAGGCCGA 
CACCGACGCGGTGMCTTTTCCCAGCTCAGGCMGTCGAGACCGCATCGGGCMTACCGACCAGCGGGCGGTCAMTATGACTGGACC 
GACGCCAATACGAATGGCGTGATCGATGAGGGCGAACTCAACCTCGATAGCGTGACCCTTGCCGGGGGCATGGGCGGCACCAGGATCT 
CCAATCTTGCGCCCGGCGCCTTGAGCGCGGCATCGACCGATGCCGTCAACGGCAGCCAGCTCTTCGGGCTTCGCAGCCGCGTATCCAA 
CGTGGCCGTCGCCCTGGGCGGGGGAGCCGCCTATGACCCTGTCAAGGATGAGTGGATCGCCCCGAMTACACGATCGGCGGCACCGAC 
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TACAGCAATGTCGGCGACGCGCTGGCGGCGGTGGGCGGCACGGCCGGTGCCGGCTGGAGCCTCTCGGCGCAGGGTGCGMCGCGTCCA 
ATGTCGCGCCGGGCGAGACGGTGGATCTTCGCAGCGGCGACGGCAATATCGTCGTCAGCAAGGCGGAGACCGGCGACACCGTGAGCTT 
CGACCTGGCCGACGATCTGGACGTCTCCGAAAGCATCACGGTGGGTGCCGATCCCGCCGATCCGAATGCGCCGACCACTGTTATCACC 
GGCGGTTCGATCGTGATCGGCAGCACCATGCTCGGCAGCAACGGCCTGGTCATCACCGGGGGGCCGAGCGTCACGACCGATGGCATCG 
ATGCCGGCGGCATGAAGGTCACGAATGTGGCAAACGGTACGGTGGCGAAGGATTCGAAGGATGCCGTCAACGGCGGCCAGCTCTTCGA 
CGTCGTTGCGAATGCCACTGCGAATGGCGTCGGCTATGACGACAAAAGCAAGGGCACCCTGACGCTGGAGGGGGCTAACGGCACCAAG 
ATCACCAATGTCGCCGCGGGCGACCTGAACGCGAACTCGACCGACGCGGTCAACGGCAGCCAGCTTTACGCCACGAACGTGAAGGTCG 
ATCGGCTCGATACCGAAGTGAAAGAGATCGACAGCCGCGTAACCTATATCGAGAGCTTCCAGGGCGATCTGGAGAACGCTGCCGTCTA 
TGATACCGACGCTGCCGGCAAGAGGCTCAACACGCTGACGCTCGAGGGAGGAGATCCCGACAAGCCGGTGCTCATCGCCAATGTGGCC 
AAGGGCGTGAAGGCGACCGACGCCGTCAATGTCGGCCAGCTCGACGAAAGCGTCGCGGAAAGCAAGAGCTACACGGACGAAAAGACCG 
AGTGGGCGATCGATCAGGCGGCCATCTACACCGACCAGGTTATCGAGACCAAGGTGAGCGCGGTGAACAATTATGCGCAACAACGGTT 
CGCGCAGCTCTCGGGCGAGATCGGGCAGGTTCGGAGCGAAGCGCGGCAAGCCGCCGCCATCGGACTTGCGGCGGCCTCGCTGCGCTTC 
GACAATGAGCCGGGCAAGCTGAGCGTGGCGCTCGGCGGCGGCTTCTGGAGAAGCGAAGGGGCGCTCGCCTTCGGTGCCGGCTACACCA 

GCGMGACGGACGCGTCCGGGCGMCCTGACCGGTGCTGCGGOT^ 
CTGA 

SEQ ZD NO: 36 (Bradorh izobium japonicum) 

GTGCGGGCCTTCGGCTCCGGCMCGCCATCMCGGCACCMCTACGCGGCCGTCGGCTCCMCMCGTCGTCGCCGGCMCMCGGCG 
CGGTTGTCGGCTCCGGCAACGGCGTCACCGGCGACAACACCGCGGCCTTCGGCTCCAGTATCGGCATCGCCGGCGGCAACAACGCGGC 
TGTCGGCTCCTTCAGCACCGTCACCGGCAGCAACAGCGCGGCTGTTGGCTCCTTCAACAACGTCAGCGGCAATAACAGCGGCGCCTTT 
GGCACCGGACAGAACATCCGCGGTAACGGCACGTTCGCCATCGGCGATCCCAACATCGTCAACGGCAACAACAGTTTGGTGTTCGGCG 
ACAACAACACGGTGAACGGGTCCAATGTGGCGGGTCGCGGCGACAACATCCAGCTGGTCGGCTCGAACAACACCATTGCGGCAACGTC 
CAGCGCCGCCGGCTCATCCGTGTTCGGCAGCGGCAACACCGTCAACGCCACCAATGCCGTGGTGATGGGCAACAACAGCACCGTCTCC 
GGCGCGTCCTCTGTCGCGATCGGCAATGGCACGGCCGTTACCGGCATCAACGCGATCGCGATGGGCACCGGCGCCGGCGCGAATTTCG 
ACAACTCGGTCGCGATCGGCAGCGGCGCGACCACGACGCGCGCCAACCAGGTCGCAGTCGGCACCGCCAGCAGCACCTACACGATGAG 
CGGCATCACCTCGGCTGCGAGCAAGGCGGCGCAGTCCGGACCGACCCAGCTCGTCACATCGGACGCGGCCGGCAATCTGGCGACCACC 
TCGCTGGCTGGCCTCGGACTTGCCTCCGCCGGCGACATCAACGGAATCAACTCCCAACTGGCCGCCCTCAACGGCCGCGTCGACAACC 
TGACACGGGAGTCGCGCGGCGGCGTGGCGTTGGCGCTCGCGGCGTCCAGCCTGCAATTCGATCCTCGCCCCGGCAAGATCTCGGTCTC 
CGGCGGCTTCGGCAATTTCCAGGGACAATCCGGCCTCGCGGTCGGCCTCGGCTATTCCTATTCGGACGCCATGCGCTTCAACGCGGCG 
TTCACGGCCGCACAGCAAGGCGCCATCGGCGTCAGGGCCGGCGCGTCGTGGACGCTGAACTGA 

SEQB) NO: 37 (NadA) 

MSMKHFPSKVLTTAILATFCSGALMTSDDDVKKMTV^ 

KKWTNLTKTWENKQNVDAKVKAAESEIEKLTT 

VDKHAEAFNDIADSLDETNTKADEAVKTANEMQTAEE 

KADIAKNSARIDSLDKNVmRmR^ 

SAAYHVGVNYEW 

SEQ ID NO: 38 (YadA) 

MTKDFKISVSAALISALFSSPYAFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVA 

VGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDR 

SKTDRENSVSIGHESLNRQLTHIjy^GTKDTDAVNVAQLKKEIEKTQENTNKRSAELIiANMAYADNKSSSVLGIMN 

NARKEAFAQSKDVLNMAKMSNSVARTTLETAEEHMSVARTTLETAEEHMKKSAEALASANVYADSKSSHTLKTMS 

TKKAIRESNQYTDHKFRQLDNRLDKLDTRVDKGLASSAALNSLFQPYGVGKVNFTAGVGGYRSSQALAIGSGYRVNENVM 

GSSDVMYNASFNIEW 

SEQ ID NO: 39 (Consensus) 

MXXMKXXKXXLLAXAIXAXFSXGALAMTXDXXXTGPEAXXVXIXPXAXXXLXXXXX 
GLXXXXXXPNIXTXXXXXQTKDQIMLIRXXXNLXENKXXVXXXVMIKXIPKDLIAKXADVDXX 

SHVAMHGYSIAIGDRSKTDRENSVSIGHESLNRXLXXmmEEXXENIAQIDXNXEQXXEXXDmEXXraXXXLAXXmm 
XAXXVNLXmXXXXmXXTTXNXLXQKIAEXKXNXDAXKXKmXXXXmXXXmXXm 
XTXXIATEXSXXFEXXXXKXXQXXQXIMNXTXVAIXEQXIXXNTARIDXLDNRVNXLDKEXKAGLASQMLSGLFQPYNV 
AVGGYXSXXAXAIGXGSXRFNENXAAKAGVAXDTQXGGSSAGYXVGVNFEW 

5 SEQ ID NO: 40 (primer) 

TATCGGCAAACGACGCAAGC 

SEQ ID NO: 41 (primer) 

GGGCGATTAGCCATTGATAC 
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SEQ W NO: 42 (primer) 

AACGGTTGATGCCGCACTAG 

SEQ ID NO: 43 (primer) 

GTGTTGATAGGTGGCACTACTG 

SEQ ID NO: 44 (primer) 

GCAGAGAAAGCAAMGGTGATTC 

SEQ ID NO: 45 (primer) 

CAAAATTCTGCAGACATCGCMC 

5 SEQ ID NO: 46 (primer) 

CAAACTGCAGTTGCTGTAGG 

SEQ ID NO: 47 (primer) 

ACCTACAGCAACTGCAGTTTG 

SEQ ID NO: 48 (primer) 

CAACTCCCTCTTCTAAAGCTG 

SEQ ID NO: 49 (primer) 

AGTAGTGCCACCTATCAACAC 

SEQ ID NO: 50 (HadA) 

CGACGCAAGCCMGTAATAGTAATATTTAATTAGGTATGATGTAAATTCTGCTTGAGGCAAATTTTACATAGGAAATTTTTCTATATT 
GCTTTAACGTTTTTTTATAGTAGAAGTATATACTCAGTTATGGTTATGGTTACATAGTATAGTTTTACTTTGTTCTAGTTCACTTTAA 
TAACCTTAAATAATTGAGGATTTCTTATGAAAAGAAATTTATTAAAACAATCTGTAATCGCTGTGTTGATAGGTGGCACTACTGTTTC 
TMTTATGCTTTAGCACAAGCACMGCAC^ 

GCTGCTATCGATGGTATTCTTGATGATMTATTGCTTATGAAGCTGAAGTTGATGCAAAACTTGATCAGCATTCTGCTGCTCTTGGTA 
GACATACAAATAGACTCAATAATCTTAAAACGATTGCAGAGAAAGCAAAAGGTGATTCAAGTGAAGCACTTGATAAAATTGAAGCTCT 
TGAAGAACAAAATGATGAGTTTTTAGCGGATATTACAGCTTTAGAAGAGGGAGTTGATGGTTTAGATGATGATATCACAGGTATTCAA 
GATAATATTTCTGATATAGAAGATGATATTAATCAAAATTCTGCAGACATTGCAACTAACACAGCGGCAATCGCAACTCACACTCAAC 
GTCTTGATAATTTAGATAACAGAGTAAATAACCTTAATAAAGATCTTAAACGTGGTCTTGCTGCTCAAGCTGCATTAAATGGTTTATT 
CCAACCGTATAACGTAGGTAAATTAAATCTTACTGCTGCTGTAGGTGGTTATAAATCTCAAACTGCAGTTGCTGTAGGTACTGGTTAT 
CGTTATAACGAAAATATCGCGGCTAAAGCAGGTGTTGCTTTCACTCATGGTGGCAGCGCAACTTATAATGTTGGCGTAAATTTTGAAT 
GGTAATTAGATAACTAATTTTCCATAACAGAAAATAAATACCTGTTTTTGAGTAATATCAGAAACAGGTATTTTTTTATAGGCTTCGT 
TTCGCACACTCGTTACTAGTGTGGATATGTGAATAAAATCAATAATATTTGGAGTATTTCATCTATTTTATTAATTTTGTTAGCGGAT 
AAAATAACTTTCTGTGTGTTCCTCCAAATTCTTTAAATTCAATAAACCCTAATCGTTCATAAAACGAGCTGGCATCATCATTTTTTGC 
TTCAACGACTAAAATTGTAGCAGCTACCGTAGCATTCTTAATCTTATGAATGGCGTCTGCAATTAGAAACTTTCCATACCCTTGCTTC 
TGGAAATTCGTATCAATGG 

10 SEQIDNO: 51 (HadA) 

MKRNLLKQSVIAVLIGGTTVSNYAIAQAQAQAQVKK^ 

KTIAEKMGDSSEALDKIEALEEQNDEFLADITALEEGVE)GLDDDITGIQDNISDIEDDINQNSADIATNTAAIATHTQRLDNLDNRV 
NNLNKDLKRGLAAQAALNGLFQPYNVGKLNLTAAVGGYKSQTAVAVGTGYRYNENIAAKAGVAFTHGGSATYNVGVNFEW 

SEQ ID NO: 52 (HadA C-terminus) 

TGYRYNENIAAKAGVAFTHGGSATYNVGVNFEW 

SEQIDNO: 53 (Gene downstream of HadA) 

ATGATAAATGAAAATTTAGCATATTTAAGTGTATTACCCTTAGAAGATGTAAAGATTGAGAGAAGTTCATTTTCTTGTTCAGTTGAAC 
CTTTAGAAAACTACTTTCATAAGTACGTTTCTCAAGATGTAAAGAAAGGGCTTGCAAAGTGTTTTGTGCTTATAAATGCACAACCATC 
TAGGATTGTTGGCTATTACACTTTATCGGCATTATCAATACCAATTCCAGATATACCCCAAGAACGAATAAGTAAAGGCGTACCATAT 
CCTAATATTCCTGCTGTTTTAATAGGGCGATTAGCCATTGATACGAATTTCCAGAAGCAAGGGTATGGAAAGTTTCTAATTGCAGACG 
CCATTCATAAGATTAAGAATGCTACGGTAGCTGCTACAATTTTAGTCGTTGAAGCAAAAAATGATGATGCCAGCTCGTTTTATGAACG 
ATTAGGGTTTATTGAATTTAAAGAATTTGGAGGAACACACAGAAAGTTATTTTATCCGCTAACAAAATTAATAAAATAG 

SEQ ID NO: 54 (Sequence encoded by SEQ ID NO: S3) 

MINENLAYLSVLPLEDVKIERSSFSCSVEPLENYFHKYVSQDVKKGLAKCFVLINAQPSRIVGYYTLSALSIPIPDIPQERISKGVPY 
PNIPAVLIGRLAIDTNFQKQGYGKFLIADAIHKIKNATVAATILVVEAKNDDASSFYERLGFIEFKEFGGTHRKLFYPLTKLIK 
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SEQ ID NO: 55 (Histone acety [transferase) 

MINENLAYLSVLPLEDVKIERSSFSCSVEPLENYFHKYVSQDVKKGLAKCFVLINAQPSRIVGYYTLSALSIPIPDIPQERISKGVPY 
PNIPAVLIGRLAIDTNFQKQGYGKFLIADAIHKIKNATVMTILWEAKNDDASSFYERLGFIEFKEFGGTHRKLFYPLT 

SEQ ID NO: 56 

GCAAGCCAAGTAACAGTAATGTTTAATTAGGTATGATTTAAATTCTGTTTTATATCACACTAGCAATGTGGGTTTCTTGTATTGGTAT 
TAACTAAATTACGCATTAATAAAGCGTAATTTAAGTTAATATCTTGTGGTACATTTAAGAATACAAAATGCCCATCACCTAGTG 

SEQ ID NO: 57 

GCAAGCCAAGTAACAGTAATGTTTAATTAGGTATGATTTAAATTCTGTTTTATATCACACTAGCAATGTGGGTTTCTTGTATTGGTAT 
TAACTAAATTACGCATTAATAAAGCGTAATTTAAGTTAATATCTTGTGGTACATTTAAGAATACAAAATGCCCATCACCTAGTG 

SEQWNO:58 

GCAAGCCAAGTAACAGTAATGTTTAATTAGGTATGATTTAAATTCTGTTTTATATCACACTAGCAATGTGGGTTTCTTGTATTGGTAT 
TAACTAAATTACGCATTAATAAAGCGTAATTTAAGTTAATATCTTGTGGTACATTTAAGAATACAAAATGCCCATCGCCTAGTG 

5 SEQWNO:59 

GCAAGCCAAGTAACAGTAATGTTTAATTAGGTATGATTTAAATTCTGTTTTATATCACACTAGCMTGCGGGTTTCTTGTATTGGTAT 
TMCTAAATTACGCATTMTAAAGCGTMTTTMGTTMTATCTTGTGGTACATTTAAGMTACAAAATGCCCATCGCCTAGTG 

SEQWNO:60 

GCAAGCCAAGTAACAGTAATGTTTAATTAGGTATGATTTAAATTCTGTTTTATATCACACTAGCAATGCGGGTTTCTTGTATTGGTAT 
TAACTAAATTACGCATTMTAAAGCGTMTTTAAGTTMTATCTTGTGGTACATTTMGAATACAAMTGCCC^TCGCCTAGTG 

SEQIDNO:61 

GCAAGCCMGTAACAGTAATGTTTAATTAGGTATGATTTAAATTCTGTTTTATATCACACTAGAAATGAGGATTTCTTGTATTGGTAT 
TAACTAAATTACGCATTAATAAGGCGTAATTTAAGTTAATATCTTGTGGCACATTTAAGAATACAAAATGCCCATCGCCTAGTG 

SEQIDNO:62 

TTAGGTATGATTTAAATTCTG 

SEQ ID NO: 63 

ATAGTATAGTTTTACTTTGTTCTAGTTCACTTTAATAACCTTAAATAATTGAGGATTTCTTATGAAAAGAAATTTAT 

10 SEQ ID NO: 64 

CCGACGCMGCCMGTMTAGTMTATTTMTTAGGTATGATGTAAATTCTGCTTGAGGCAAATTTTACATAGGAAATTTTTCTATAT 
TGCTTTAAGGTTTTTTTATAGTAGAAGTATATACTCAGTTATGGTTATGGTTACATAGTATAGTTTTACTTTGTTCTAGTTCACTTTA 
ATMCCTTAAATAATTGAGGATTTCTTATGAAAAGAAATTTATTAAAACAATCTGTAATCGCTGTGTTGATAGGTGGCACTACTGTTT 
CTAATTATGCTTTAGCACAAGCACAAGCACAAGCAC 

SEQIDNO.65 

AGGATACGAAAAATATCGGCAAACGACGCAAGCCAAGTAACAGTAATGTTTAGGCTTGTATAGTATAGCTTTGCTTTGTTCTAGTTCA 
ATTTMTMTCTTAAATAATTAAGGATTTCTTATGAAAAAAAATTTATAGGCTTCGTTTCGCACACTCGTTGCTAGTATAGATATGTG 
AATA 
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